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Executive Summary 


The Moderate Resolution Imaging Spectroradiometer (MODIS) semi-analytical (SA) algorithm calculates the 
spectral absorption properties of surface waters, splitting them into those associated with phytoplankton, 
aph(X), colored dissolved organic matter or gelbstoff, ag(X), and water, aw(X). The SA algorithm is designed to 
respond to variable ratios of ap h (X) to a^iX) and to wide ranges in the chlorophyll-specific phytoplankton 
absorption coefficient, a* ph (X), for a given chlorophyll-o level. 

The gelbstoff absorption coefficient (absorp_coef_gelb, parameter 30) can be used to map river plumes, 
determine diffuse attenuation coefficients, and calculate dissolved organic carbon (DOC) standing stocks and 
fluxes. The phytoplankton absorption coefficient, ap h (675) (chlor_absorb, parameter 3 1), is used to derive 
the concentration of chlorophyll-a (Chlor_a_3, parameter 27), a measure of the standing stock of living ocean 
carbon. Quantifying both phytoplankton and gelbstoff absorption coefficients accurately from space is 
critical for evaluating the global oceanic carbon budget. 

The SA algorithm also outputs total absorption coefficients, a(L,) (tot_absorb, parameters 32-36) at i=412, 
443, 488, 531, and 551nm which are used in the calculation of the absorbed radiation by phytoplankton 
(ARP, parameter 29). ARP is needed to convert fluorescence line height (FLH; see M. Abbott MODIS 
ATBD) into a value that represents the chlorophyll fluorescence efficiency (CFE) of the phytoplankton. In 
turn, CFE may be inversely proportional to the quantum yield of photosynthesis. While the use of FLH and 
CFE in estimating photosynthetic rates is the subject of much debate, the possibility of using satellites to 
measure primary production is enticing. The absorbed radiation by phytoplankton is also dependent upon the 
instantaneous photosynthetically available radiation (IPAR, parameter 28) (Gregg and Carder, 1990). 

The following website can be referred to for information regarding the ARP and IPAR data products: 
http://modis.gsfc.nasa.gov/data/atbd/atbd mod20.pdf . Data quality summaries for ARP and IPAR can be 
found at http ://modi s-ocean. gsfc .nasa. gov/aa / . Briefly, however, we found that MODIS Terra IPAR data are 
consistent with our May 2000 measurements in the Bahamas. MODIS Terra data at the Hawaii Ocean Time 
Series (HOTS), Bermuda Atlantic Time-series Study (BATS), and Hawaii’s Marine Optical Buoy (MOBY) 
sites are now being outputted for future data comparisons. Values within 5% are expected. 

Unlike accuracy estimations for IPAR, assessing the accuracy of MODIS-derived ARP values is not directly 
achievable. This is because field measurements of both a(L,) and IPAR are typically not available for testing 
algorithm performance. Indirect evaluations of ARP have been made, however, based on ship radiometry of 
ARP components. If accuracies of ~7% are attributed to IPAR, -30% to aph(X) (Carder etal, 1999), and 10% 
to diffuse attenuation, then the root-mean sum of squares of the error sources provides an error estimate for 
ARP of -32%. 

Since, chlorophyll-a concentration is the most widely used product derived from ocean-color data, the rest of 
this document focuses on the algorithm development, updates, and validation efforts of the semi-analytically 
derived MODIS ocean data product, Chlor_a_3. Chlor_a_3 retrievals from field water-leaving radiance data 
have successfully been validated to within about 25%. Thus, the 35% accuracy goal established by the ocean 
color community, based upon a goal of 5% accuracy for normalized water-leaving radiance values, has been 
achieved. Accuracies of -30% for aph(675) and -40% for ag(400) were calculated previously by Carder et al. 
(1999). Here, we use data accuracies for Chlor_a_3 as a proxy for ap h (675) and ag(400) accuracies, since field 
measurements of ap h (675) and ag(400) are scarce. 

The following Algorithm Theoretical Basis Document (ATBD- 19) is considered a so-called “living” 
document because it tracks the changes that have occurred to the semi-analytical algorithm since the contract 
began. 


The algorithm was first developed (Section 3) and evaluated (Section 4) using tropical/subtropical and 
summer temperate field data. Algorithm parameters for three bio-optical domains (unpackaged, global, and 
packaged) were developed to contend with the ten- fold variability observed globally in chlorophyll-specific 
phytoplankton absorption coefficients. A strategy for transitioning between these bio-optical domains based 
on comparisons between satellite-retrieved sea surface temperatures and nitrate depletion temperatures (NDT) 
was introduced and tested in Section 5. Algorithm errors of more than 45% were reduced to less than 30% 
with this approach. 

Access to data deposited by the ocean color community into the SeaWiFS Bio-optical Archive and Storage 
System (SeaBASS) later led to the expansion of bio-optical domains into high-latitude, polar regions (Section 
6). A set of “fully-packaged” algorithm parameters was developed in this section along with an updated 
NDT-based blending strategy to smoothly transition between the “old” unpackaged and the “new” fully- 
packaged pigment domains. Chlorophyll concentrations underestimated by as much as factor of two using 
empirical algorithms tuned to tropical, subtropical, and temperate waters could now be derived more 
accurately using this semi-analytical approach. 

With the launch of MODIS Terra in December 1999 and the release of Collection 4 ocean color data products 
in June 2002, validation of MODIS radiances and radiance-derived chlorophyll-a concentrations was 
permitted. Match-up comparisons have recently been made between both MODIS-derived semi-analytical 
(Chlor_a_3) and empirical (Chlor_a_2) chlorophyll-a concentrations and field data using reprocessed “004” 
(Section 7) and reprocessed “041” (Section 8) MODIS Terra data. 

Using MODIS Terra (Collection 4, reprocessing 041) normalized water-leaving radiance data, field 
chlorophyll-a concentrations from the SeaBASS data archive were compared to the Chlor_a_2 and Chlor_a_3 
data products. The chlorophyll algorithms were assessed first using field radiance values and then using 
satellite radiance values. Chlor_a_2 and Chlor_a_3 retrieval errors using field radiance data were 52% and 
25%, respectively. The retrieval errors using MODIS Terra radiance values increased to 56% and 48%, 
respectively. 

Error simulations indicate that the increased Chlor_a_3 error observed when switching from field to satellite 
radiance data can partially be explained by errors observed in retrieving normalized water-leaving radiances 
for each of the 412, 443, and 55 1 nm wavebands. In order for MODIS radiance derived Chlor_a_3 values to 
be accurate within the 35% accuracy goal, normalized water-leaving radiance error levels less than 14% are 
required from each waveband. 

In addition, MODIS retrievals of Chlor_a_3 may improve with a) the addition of more open-ocean match-up 
data, b) a better filter for flagging and possibly correcting absorbing aerosols under coastal and/or dust-rich 
atmospheres, and c) improved polarization corrections. Regarding match-up imagery, changing 3-by-3 pixel 
medians at scene edges where chlorophyll-a concentrations are not patchy to 5-by-5 pixel means may reduce 
detector-striping and side-banding effects that can decrease algorithm accuracies. 

The algorithm development and validation work using field data (Sections 3-5) has been peer-reviewed and 
published in Carder et al. (1999). Similarly, the expansion of bio-optical domains into high latitude regions 
along with the initial match-up analysis using MODIS Terra reprocessed 004 data (Sections 6 -7) has also 
been peer-reviewed and was recently published in Carder et al. (2004). 

In Section 1 0, we list the “lessons learned” throughout the course of this contract. Primarily, we learned the 
following: 1 ) overestimations in empirically-derived chlorophyll-a concentrations (e.g. Chlor_a_2) caused by 
CDOM can be avoided using a semi-analytical approach capable of separating the absorption spectra of 



CDOM and phytoplankton and 2) underestimations in Chlor_a_2 by a factor of two compared to field and 
Chlor_a_3 data can occur in upwelling and high-latitude regions if low chlorophyll-specific absorption 
coefficients found in this region are not considered. 

One final note, since errors associated with the empirical Chlor_a_2 algorithm when applied to field radiances 
are significantly greater than 35%, no amount of error reduction in satellite-derived, normalized water-leaving 
radiances will reduce Chlor_a_2 retrievals to 35%. Thus, it is strongly urged that the ocean color community 
continue to generate a MODIS Chlor_a_3 data product and reprocess SeaWiFS data (1997-present) using 
AVHRR sea surface temperature data and the Chlor_a_3 algorithm in order to achieve a long-term climate- 
quality global data set for ocean chlorophyll-a and derived products such as primary productivity, 
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1.0 Introduction 

The Case 2 chlorophyll a algorithm is based on a semi-analytical, bio-optical model of remote- 
sensing reflectance, R :s (7*), where R re (/.) is defined as the water-leaving radiance, Lw(X), divided by the 
downwelling irradiance just above the sea surface, Ea(X,0 + ). The R re (X) model (Section 3) has two free 
variables, the absorption coefficient due to phytoplankton at 675 nm, a ? (675), and the absorption 
coefficient due to colored dissolved organic matter (CDOM) or gelbstoff at 400 nm, ag(400). The R re 
model has several parameters that are fixed or can be specified based on the region and season of the 
MODIS scene. These control the spectral shapes of the optical constituents of the model. R re (X,,) values 
from the MODIS data processing system are placed into the model, the model is inverted, and a 9 (675), 
ag(400) (MOD24), and chlorophyll a (MOD21, Chlor_a_3) are computed. 

The algorithm also outputs both the total absorption coefficients, a(^) (MOD36), and permits 
calculation of the phytoplankton absorption coefficients, a^) (MOD36), based on 0,(675) retrievals. 
These are used in the calculation of the absorbed radiation by phytoplankton (MOD22) for use in the 
calculation of chlorophyll fluorescence efficiencies (MOD20). MOD22 is also dependent upon the 
instantaneous photosynthetically available radiation (MOD22), which is discussed in ATBD20. 

Algorithm development is initially focused on tropical, subtropical, and summer temperate 
environments, and the model is parameterized in Section 4 for three different bio-optical domains: (1) 
high ratios of photoprotective pigments to chlorophyll and low self-shading, which for brevity, we 
designate as “unpackaged"; (2) low ratios and high self-shading, which we designate as “packaged”; and 
(3) a transitional or global-average type [Carder et al., 1999]. These domains can be identified from space 
by comparing sea-surface temperature to nitrogen-depletion temperatures for each domain (Section 5). 
Algorithm errors of more than 45% are reduced to errors of less than 30% with this approach, with the 
greatest effect occurring at the eastern and polar boundaries of the basins. Section 6 provides an 
expansion of bio-optical domains into high-latitude waters. The “fully packaged” pigment domain is 
introduced in this section along with a revised strategy for implementing these variable packaging 
domains [Carder et al., 2004]. Chlor_a_3 values derived semi-analytically and Chlor_a_2 values derived 
empirically using the O’Reilly et al. (2000) OC3M algorithm from MODIS Terra radiances are compared 
to field chlorophyll-a concentrations in Sections 7 and 8. 

2.0 Overview and Background Information 

According to the optical classification by Morel and Prieur (1977], oceanic waters may be 
characterized as Case 1, in which the optical properties are dominated by chlorophyll and covarying 
detrital pigments, or as Case 2, in which other substances, which do not covary with chlorophyll, also 
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affect the optical properties. Such substances include gelbstoff, suspended sediments, coccolithophores, 
detritus, and bacteria. Pigment retrievals from Coastal Zone Color Scanner (CZCS) data in Case 1 waters 
have achieved reasonable results (± 40% for local best cases [Gordon et ah , 1983]). However, substances 
not covarying with chlorophyll in Case 2 waters have caused the retrieval of pigment concentrations to 
have inaccuracies as high as 133% [Carder et ah , 1991], 

CDOM absorbs light in an exponentially decreasing manner as a function of wavelength. 
Pheopigments, detritus, and bacteria similarly absorb more strongly at 412 rnn than they do at 443 nm. 
Phytoplankton, on the other hand, absorb more strongly at 443 nm than at 412 nm. Thus, by measuring 
the relative amounts of light leaving the sea surface at those two wavelengths, we can estimate the relative 
amounts of phytoplankton and the other absorbing products mentioned above. 

Winter convective over turn of the upper ocean layer also mixes up gelbstoff-rich deeper waters 
that have not been photo bleached unlike summer surface waters. These waters appear in CZCS data as 
being more chlorophyll-rich than measurements and models indicate [Siegel and Michaels , 1996]. 

The remote sensing reflectance, R^, model used to develop the algorithm presented here has a few 
parameters that cannot be fixed and applied to the entire globe; that is, they are site- and season-specific. 
This is due to the inherent variability of many bio-optical constituents. For example, absorption at 440 
nm per unit chlorophyll a by phytoplankton can change with species and with nutrient and lighting 
conditions by more than an order of magnitude [Morel and Bricaud , 1981; Kirk , 1983; Carder et ah , 

1991; Morel et ah , 1993; Bricaud et ah , 1995]. In addition, particle size and concentration have 
significant effects on the spectral backscattering coefficient bb(A) of ocean water: pure water backscatter 
varies as ~ X 4 , large-particle backscatter varies as ~ X°, and backscatter by smaller-diameter detritus and 
bacteria varies with a spectral dependence between the two extremes [Morel and Ahn, 1990, 1991]. 

Many of these factors covary, allowing the simple wavelength-ratio algorithms of the CZCS 
[Gordon and Morel , 1983] to work fairly well. We have tried to understand many of these individual 
covariances and have developed empirical expressions for several individual bio-optical variables. By 
analyzing individual components of the model, we can gain a deeper understanding of the processes 
affecting the color of water-leaving radiance. 

To the extent that such covariances change with season or bio-optical domains [e.g., see Prieur 
and Sathyendranath, 1981; Sathyendranath et ah , 1989; Mitchell and Holm-Hansen, 1991], we must 
consider temporal and spatial changes in algorithm parameterization. A strategy for partitioning the ocean 
into at least three different bio-optical domains on the basis of nutrient-temperature relationships, each 
with different model parameters, is discussed in Section 5. Section 6 provides an expansion of bio-optical 
domains from mainly tropical, subtopical, and summer temperate environments into high-latitude waters. 
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2.1 Experimental Objective 

The main data product is chlorophyll a concentration, [chi a], which can be used as an indicator 
of plankton biomass, as an input to primary production models (MOD27), or to trace oceanographic 
currents, jets, and plumes. Other output products are a^(675), ag(400), a^), and a(Xi). Total absorption, 
a(Xj), is necessary for calculating the light absorbed with depth for heat budget models. a,(kj) is used in 
the IPAR/ARP MODIS algorithm (MOD22). ag(400) by itself can be used to map river plumes, to 
determine diffuse attenuation at that wavelength, or to calculate dissolved organic carbon (DOC) standing 
stocks and fluxes. In order to calculate DOC, we need to know how DOC concentration is related to 
DOC absorption. As coastal, estuarine, and other Case 2 environments become increasingly recognized 
as important areas of study, algorithms that can deal with the complex bio-optical properties of these 
regions will become available. 

2.2 Historical Perspective 

CZCS algorithms for estimating [chi a ] plus pheophytin a concentrations perform quite well for 
regions of the ocean where scattering and absorbing components of seawater covary with these pigments, 
i.e., in Case 1 waters [Gordon and Morel, 1983; Gordon et al., 1983]. A number of empirical and semi- 
analytical optical models have been developed to simulate the behavior of the underwater light field for 
such waters [Morel and Prieur, 1977; Baker and Smith, 1982; Gordon et al., 1988; Morel, 1988 , Mitchell 
and Holm-Hansen, 1991, O’Reilly et al., 1998]. Such models have been used as the basis for classifying 
water types and/or for developing remote sensing algorithms. 

However, the accuracies of these models decrease when environmental conditions depart from 
those representative of the data set used to empirically derive the covariance relationships. For instance, 
CDOM is produced when grazing, photolysis, and other mechanisms degrade the viable plant matter at 
and downstream from phytoplankton blooms. The CDOM-to-chlorophyll ratio will change dramatically 
for a parcel of upwelled water over a relatively short time, from chlorophyll-rich and CDOM-poor during 
a bloom to CDOM-rich and chlorophyll-poor after being grazed down. Solid evidence for the occurrence 
of this scenario can be found in two separate studies. Peacock et al., [1988] found that absorption 
attributed to CDOM at 440 nm was at least 16 fold that due to phytoplankton pigments within an offshore 
jet from an upwelling region, whereas pigments were the dominant absorption agents at the upwelling 
center near the coast. Similarly, Carder et al., [1989] found that particulate absorption at 440 nm 
decreased 13 fold while CDOM absorption at 440 nm increased by 60% in ten days for a phytoplankton 
bloom tracked from the Mississippi River plume to Cape San Bias. This widely varying CDOM-to- 
chlorophyll ratio has a profound effect on upwelled radiance in the blue 443 nm band of the CZCS, and a 
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smaller but still significant effect in the green 520 nm band. For CDOM-rich waters, the correspondence 
in absorption at 443 nm and 520 nm between CDOM and chlorophyll creates erroneously high estimates 
of pigment concentration in those models which rely solely upon either of these spectral bands to indicate 
absorption due to phytoplankton. 

Carder et aL, (1991) proposed that a short wavelength channel at around 410 nm could be used to 
distinguish CDOM (and other degradation products) from chlorophyll. Data for a channel at 412 nm is 
available not only for MODIS, but also for Ocean Color and Temperature Scanner (OCTS) and Sea- 
Viewing-Wide-Field-Sensor (SeaWiFS) imagery. 

2.3 Instrument Characteristics 

The algorithm requires input of the water-leaving radiance, L w , at the MODIS ocean wave bands 
centered at 412, 443, 488, and 551 nm, respectively, and bio-optical domains are designated based upon 
sea-surface temperature (Section 5), a derived product of MODIS. Rrs is derived from the water-leaving 
radiance L w [ Gordon and Wang, 1994], as = Lw(F 0 cos 0 1^)* 1 , where F 0 is the extraterrestrial solar 
irradiance, 0 is the solar zenith angle, and td is the transmittance of light across the air-sea interface. The 
1000 m resolution, new spectral bands, and near-daily coverage of MODIS will allow the observation of 
mesoscale oceanographic features in coastal and estuarine environments, areas seen to be increasingly 
important in many marine science studies in addition to traditional open-ocean observations. 

3.0 Algorithm Description 

Morel and Gordon [1980] describe three approaches to interpret ocean color data in terms of the 
in situ optical constituents: empirical, semi-empirical, and analytical. In the analytical approach, radiative 
transfer theory provides a relationship between upwelling irradiance or radiance and the in situ inherent 
optical properties backscattering and absorption [e.g., Sathyendranath and Platt, 1997]. Then, constituent 
concentrations are derived from irradiance or radiance values measured at several wavelengths by 
inversion of the resultant system of equations. The MODIS algorithm uses this approach, with the term 
"semi-analytical" invoked because bio-optical pieces of the radiative model are expressed by empirical 
relationships. The algorithms developed herein have been peer-reviewed and appear in condensed form in 
Carder et aL, [1999] and Carder et aL, [2004]. 

3.1 Theoretical Description 

3.1.1 Physics of Problem 
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After light enters the ocean, some of it eventually scatters back up through the surface. This light 
is called the water-leaving radiance L w (>.), and it can be deduced from space after removal of atmospheric 
effects. The magnitude, spectral variation, and angular distribution of this radiance depend on the 
following factors: the absorption and backscattering coefficients of the seawater, a (X) and bb(A.), 
respectively (known as the inherent optical properties); the downwelling irradiance incident on the sea 
surface E d (A,0~); and the angular distribution of the light within the ocean. To make things easier, we 
divide seawater into three components, each one having distinct optical properties of its own. These 
components are the seawater itself (water and salts), the particle fraction, and the dissolved fraction. 
Fortunately, a(k) is simply equal to the sum of the absorption coefficients for each component, and b b (>^) 
is equal to the sum of the backscattering coefficients. If we can accurately describe or model each 
spectrally distinct component of the absorption and backscattering coefficients, then we can determine the 
magnitude of each one from measurements of L W (A.) and E d (0 + ,A.), given some assumptions about the 
angular distribution of light in the water. The key here is to accurately model the spectral behavior of a(/,) 
for each component. The spectral behavior of bt,(A) is not as dynamic. 

3.1.2 Mathematical Description of Algorithm 
3.1. 2.1 R„ Model 

The R re model is given by the following general equation, which is adapted from Lee et al., 

[1994]: 


p (7)= LA bbC^i rn 

Q(A)„ 2 [aa) + b b (V ] 

where f is an empirical factor averaging about 0.32-0.33 [Gordon et al., 1975; Morel and Prieur, 1977; 
Jerome et al., 1988; Kirk, 1991], t is the transmittance of the air-sea interface, Q(/.) is the upwelling 
irradiance-to-radiance ratio Eu(X.)/L u (X), and n is the real part of the index of refraction of seawater. By 
making three approximations, Eq. 1 can be greatly simplified. 

1) In general, f is a function of the solar zenith angle, 0 O [Kirk, 1984; Jerome et al., 1988; Morel 
and Gentili, 1991]. However, Morel and Gentili [1993] have shown that the ratio f/Q is relatively 
independent of 0 O for sun and satellite viewing angles expected for the MODIS orbit. They estimate that 
f/Q = 0.0936, 0.0944, 0.0929, and 0.0881, (standard deviation ± 0.005), for X = 440, 500, 565, and 665 
nm, respectively. Also, Gordon et al., [1988] estimates that f/Q = 0.0949, at least for 0o > 20°. Thus, we 
assume that f/Q is independent of X and 0 O for all MODIS wavebands of interest, except perhaps for the 
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band centered at 667 nm, which we don’t use. 

2) t 2 /n 2 is approximately equal to 0.54, and although it can change with sea-state (Austin, 1974), 
it is relatively independent of wavelength. 

3) Many studies have confirmed that b b (/,) is usually much smaller than a(X) and can thus be 
safely removed from the denominator of Eq. 1 [Morel and Prieur, 1977; references cited in Gordon and 
Morel, 1 983], except for highly turbid waters. 

These three approximations lead to a simplified version of Eq. 1, 

R„(^)~ constant (2) 

” a(X) 


where the "constant" is unchanging with respect to X and 0o. The value of the constant is not relevant to 
the algorithm since, as will be shown later, the algorithm uses spectral ratios of R re (X) and the constant 
term factors out. 

In the following sections, both bb(X) and a(X) will be divided into several separate terms. Each 
term will be described empirically. The equations are written in a general fashion — i.e., the empirically 
derived parameters that describe each term are written as variables — and the actual values of the 
parameters that are used in the algorithm are shown in Tables la and lb. 


3.1.2.2 Backscattering Term 

The total backscattering coefficient, bj,(X), can be expanded as 


b b (X)~bbw(X) + bbp(X) 


( 3 ) 


where the subscripts "w" and "p" refer to water and particles, respectively. b bw (X) is constant and well 
known [Smith and Baker, 1981]. bb P (X) is modeled as 


b„ P (X) = X 



(4) 


The magnitude of particle backscattering is indicated by X, which is equal to bb P (551), while Y describes 
the spectral shape of the particle backscattering. 

Lee et al . , [1994] empirically determined X and Y values by model inversion using a formula 
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similar to (4). The X and Y values were compared to the R re (>i) values measured at each station with the 
purpose of finding empirical relationships for both X and Y as a function of R rs (X) at one or more of the 
MODIS wavelengths. Once this was done, X and Y could be estimated from MODIS data. These 
empirical relationships are described below. 


Table la. Wavelength-dependent parameters for the semi-analytical chlorophyll algorithm for regions 
without packaged pigments. 


X 

412 

443 

488 

551 

bbw (m' 1 ) 

0.003341 

0.002406 

0.001563 

0.000929 

aw (m~‘) 

0.00480 

0.00742 

0.01632 

0.05910 

a<> 

2.20 

3.59 

2.27 

0.42 

1 &1 

0.75 

0.80 

0.59 

-0.22 

I 

-0.5 

-0.5 

-0.5 

-0.5 

1 a3 

0.0112 

0.0112 

0.0112 

0.0112 


Table lb. Wavelength-independent parameters for the semi-analytical chlorophyll algorithm for 
regions without packaged pigments. 

wavelength independent parameters 8 


Xo 

-0.00182 


S 

0.0225 


Co 

0.2818 

X, 

2.058 

Po 

51.9 

Cl 

-2.783 

Y 0 

-1.13 

Pi 

1.00 

C 2 

1.863 

Y, 

2.57 


C 3 

- 2.387 


3.1.2.2.1 Expression for X 

The general expression for X is 


X = RJ551) (5) 

where Xo and are empirically derived constants. Linear regression performed on the derived values of 
X vs. Rrs(551) taken from four cruises to the Gulf of Mexico (CP92, Tambax 2, GOMEX, and COLOR) 
resulted in Xo and Xi values of -0.00182 and 2.058 (n = 53, r 2 = 0.96). Figure 1 shows the regression 
graphically. If X is determined to be negative from Eq. 5 it is set to zero. 
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Figure 1. X versus R^(551), where X is the magnitude of particle 
backscattering and is the remote sensing reflectance at 551nm. 
The line is the linear regression equation X— 0.00182 + 2.058 
RJ551) (n=53, ^=0.96). 


The values of Xo and Xi that are used in this version of the Case 2 chlorophyll algorithm are 
probably adequate for most of the globe and they are listed in Table 1. For regions influenced by river 
outflows, these parameters should be determined on a site-specific basis. 


3.1.2.2.2 Expression for Y 

Y was found to covary in a rather general way with the ratio R„(443)/R rs (488). Variations in 
numerator and denominator values of this ratio are largely determined by absorption due to phytoplankton 
and CDOM. Absorption due to water is about the same and low at both wavelengths. Thus, to the extent 
that phytoplankton and CDOM absorption covary, the spectral ratio of the absorption coefficients, 
a(443)/a(488), will be only weakly dependent on pigment concentration, and the spectral ratio of 
backscattering coefficients should have a significant effect on the spectral ratio of R^. Y is thus 
represented as 


Y=Yo 


RJ443) 
' RJ488) 


( 6 ) 


a linear function of ^(443)^^(488) where Y 0 and Yi are empirically derived constants. 

Accurate measurements of s^(X) and accurate removal of reflected skylight from the R re 
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measurements are critical in determining Y by model inversion. Only data from the GOMEX and 
COLOR cruises are used here because the a g (^) values were determined with a long-path 
spectrophotometer [Peacock et al t 1994]. Linear regression of Y versus R rs (443)/R rs (488) for stations 
from these two cruises resulted in Y 0 and Yi values of -1.13 and 2.57 (n = 22, r 2 = 0.59). Figure 2 shows 

3.0 
2.5 

2.0 
v 1.5 

1.0 
0.5 
0.0 

0.4 0.6 0.8 1.0 1.2 1.4 1.6 

R„(*43)/^(4B8) 

Figure 2. Spectral shape of particle backscattering Y versus 
R IS (443)/R«(488). The line is the linear regression Y = - 1 . 1 3 + 

2.57 ^(443)^(488) (n=22, r*=0.59). 

the regression graphically. If Y is determined to be negative from Eq. 6 it is set to zero. A number of 
other spectral ratios of R^X) were tested, but the 443:488 ratio had the highest correlation with Y. 

The Y parameter should be large when the backscattering is due to small particles and/or water 
and vice versa [ Gordon and Morel, 1983]. In oligotrophic regions we have determined values of Y 
greater than 2, while in waters with [chi a] > 10 mg m" 3 the estimated Y values are often ~ 0. Where 
gelbstoff concentrations are high, nutrients are typically high, and pigments are more packaged. Larger 
particles and lower Y values are expected to occur here, even with a lack of covariation between pigment 
and gelbstoff absorption. 

3,1.23 Absorption Term 

The total absorption coefficient can be expanded as 

a(A) = a w ( A)+a t f,(A)+ad(A)+ag(J-) (7) 
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where the subscripts "w", "<p," "d," and "g" refer to water, phytoplankton, detritus, and CDOM ("g" stands 
for gelbstoff). Here a w (l) is taken from Pope and Fry, [1997], Expressions for a^(A.), a^X), and ag(X) 
need to be developed. 

3.1. 2.3.1 Expression for a. 

The shape of the a^(X) spectrum for a given water mass will change due to the pigment-package 
effect (i.e., the flattening of absorption peaks due to self-shading with increasing intracellular pigment 
concentration and larger cell size) [Morel and Bricaud, 1981] and changes in pigment composition. For a 
given domain, normalizing measured a^f/,) curves to a^(675) reduces the dynamic range and results in a 
smooth variation for a^(k)/a S) (675) vs. a^(675) for the MODIS wavebands centered at \ = 412, 443, 488, 
and 55 1 nm (see Figure 3) data for two high-light, subtropical regimes. To the extent that [chi a ] is 
proportional to a<p(675) (e.g. see Figure 4), Figure 3 demonstrates how the chlorophyll-specific absorption 
changes with chlorophyll concentration. 

A hyperbolic tangent function was chosen to model this relationship in order to ensure that the 
value of a 9 (X)/a ? (675) approaches an asymptote at very high or very low values of ^(675). Carder et al., 
[1991] detail the behavior of this function with parameterization, although we have substituted a^,(675) for 
the [Chi a] found in their expression. Using logarithmic scaling for both axes results in the following 
model equation for a^(k) as a function of a,(675), 

a/A) = ao(A)exp[a,(A)tonh[a 2 (AMat(675)/ a3 (A))]]*at(675) ( 8 ) 

where the parameters ao(X) to a 3 (X) are empirically determined for each MODIS wavelength of interest. 
ao(^) is the most important of these parameters, as it is directly proportional to ^(X). For simplicity, only 
ao(X) and &\(X) are varied to parameterize ^(X) 9 with 3l 2 (X) and a 3 (X) being set to the constant values of 
-0.5 and 0.01 12, respectively. Figure 3 shows the measured data and the modeled curves for a^(X) 
measurements taken from the GOMEX, COLOR, and TN048 cruises all considered to be part of the same 
high-light, subtropical domain (TN048 was an expedition to the Arabian Sea during monsoon conditions). 
The parameters ao(^) to a 3 (k) are listed in Table la. 

3.1.2.3.2 Expression for ^ and 

The ^(X) and ag(X) can both be fit to a curve of the form a x (X) = a x (400) exp[-S x (?t-400)] where 
the subscript "x" refers to either M d” or ”g" [Bricaud et al, 1981; Roesler et aL, 1989; Carder et al. , 

1991]. Owing to this similarity in spectral shape, these terms cannot be spectrally separated with the 
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Figure 3. Absorption coefficients e^(X)/d^(615) versus a^(675) for each Moderate- 
Resolution Imaging Spectrometer (MODIS) ocean wave band of interest The number 
at top right comer indicates wavelength, X. The lines are described by equation (8) using 
the parameters listed in T able 1 a, and they represent the minimum sum-of-squares errors 
for modeled versus measured values of a 9 (X)/a, p (675). 

MODIS channels, so the a<j(X) term is combined operationally with a g (X), and both detrital and CDOM 
absorption are represented by ag(X). The combined CDOM and detritus absorption term is thus written 

a g (A) = ag (400) exp'*^ (9) 

where S is empirically determined. Many researchers have reported that Sd = 0.01 1 nm' 1 , on average 
[Roesler et al, 1989]. For the GOMEX and COLOR cruises, an average value of 0.017 nm" 1 was 
measured for S g . Values reported by F. Hoge (personal communication) for the Sargasso Sea were 
somewhat higher as are those found near swampy regions of the Gulf of Mexico. The algorithm 
performance was optimized by varying S g , with the value 0.017 nm' 1 providing the smallest residual error 
compared to field measurements (Table lb). The increase in S is thought to account in part for the lack of 
gelbstoff fluorescene in the algorithm, which increases Lw(412) [e.g. see Mobley , 1994]. 

As a final note on the Rrs model, Eqs. 5-9 are written in a general way to emphasize that the 
values of the parameters Xo, Xi, Y 0 , Y 1? ao, a b and S are not meant to be absolute. They should be 
updated and changed as more data become available. These parameters may also be changed with region 
and season to optimize algorithm performance. 


ll 






3.1. 2.4 Inverting the Model 

All of the pieces of the reflectance model are now in place. Via Eqs. 2-4, and 5-9, R rs (X) can be 
expressed solely as a function of the "constant" term, 11^(443), R re (488), R re (551), a,(675), and ag(400), 
given values for the parameters for Xo, X], Y 0 , Yj, ao(X), ai(X), and S. Lwn(X) from MODIS can be 
converted into R re (X) as mentioned previously. Then, for each pixel, the R re model equation can be 
written for each of the 5 available MODIS wavebands yielding five equations written in three unknowns: 
the "constant" term, a,(675), and ag(400). 

Using spectral ratios of R re eliminates the "constant" term, since it is largely independent of 
wavelength. In principle, two spectral ratio equations can be used to solve for the two remaining 
unknowns, a,(675) and ag(400). Based on the shape of the absorption curve for phytoplankton versus 
those for CDOM and detritus, equations using spectral ratios of 412:443 and 443:551 for R„(X.) should 
provide a good separation of the two absorption contributions. Our two equations are 

Rn(412) __ bb(412) a(443) 

RJ443) bb (443) a(412) 

( 10 ) 

Rrs(443) _ bb(443) a(551) 

RJ551) bb(551) a(443) 


The right-hand side of each equation is a function of a,(675), ag(400), Rrs(443), Rrs(488) and R re (551). 
Since the R re values are provided on input, we now have two equations in two unknowns. The equations 
can usually be solved algebraically to provide values for a,(675) and ag(400). The computational method 
of solving these equations is described in Section 3.2. 

For waters with high concentrations of CDOM and chlorophyll, Lw(412) and L w (443) values are 
small, and the semi-analytical algorithm cannot perform properly. It is thus designed to return values 
only when modeled a,(675) is less than 0.03 m' 1 , which is equivalent to [chi a] of about 1. 5-2.0 mg m' 3 . 
Otherwise, an empirical algorithm for [chi a] is used, which is described in Section 3. 1.2.6. 

3.1. 2.5 Pigment Algorithm for Semi-analytical Case 

When the semi-analytical algorithm returns a value for <3,(675), [chi a] is determined via a direct 
relationship to this value. This step requires knowledge of the chlorophyll-specific absorption coefficient 
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for phytoplankton at 675 nm, <3,*(675), for the bio-optical domain involved. Bricaud et al. [1995] 
demonstrated a wide range of values for a <,*(675) using a global data set. If only surface values for waters 
in a more limited bio-optical domain (e.g., tropical and subtropical waters) are examined, however, this 
variability is greatly reduced. Phytoplankton found in high-light environments, for example, have 
relatively low concentrations of light-harvesting accessory pigments and relatively high concentrations of 
photoprotective pigments compared to plants found in samples from high latitudes, upwelling centers, or 
deep in the euphotic zone. Therefore the effects of accessory pigment absorption on the variability of the 
largely chlorophyll a dominated red peak at 675 nm are small in high-light environments. Furthermore, 
photoprotective pigments do not absorb light at 675 nm, and so they do not affect 0,(675), even if they 
are present in large quantities. 

To evaluate variations of o,( 675) with [chi a] for subtropical to tropical waters, we developed a 
data set to explore the more limited variation in surface values of a,*(675) under high-light conditions 
(see section 4.2.1 for methodology). This data set came from surface-water samples from several cruises 
in the Gulf of Mexico (BONG 1, BONG 2, BOSS 1, and WFS) and one cruise to the Arabian Sea 
(TN048). Linear regression of log([chl a]) versus log[a,(675)] yielded an equation of the form 

[chla] = p 0 *[ a /675)] p ' (11) 


For the data set mentioned above, the regression resulted in po and p\ values of 56.8 and 1.03, 
respectively ( n = 95, 0.97 on the log-transformed values). This regression and the data are shown in 

Figure 4. Within a given bio-optical domain, we find only a very weak change in a,’(675) with [chi a\. 
The exponent is close enough to 1 .0 that little error occurs by linearizing the parameter values to 5 1 .9 and 
1.00, respectively, for that domain. This suggests an average a,*(675) value of 0.0193 m 2 (mg chi)' 1 for 
subtropical data sets. 

3. 1.2.6 Pigment Algorithm for the Default Case 

When the semi-analytical algorithm does not return a value for a,(675), we provide an empirical, 
two-wavelength algorithm for [chi a] to use by default. Aiken et al., [1995] found that the 
L w (488)/Lw(55 1) ratio is best for empirical [chi a] determination. We use an equation of the form 

log [chi a] emp = co + c, logf rss) + o flog( r 35 )] 2 + Cj [\og( r35 )f (12) 
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Figure 4. The [chi a] versus a^(675) in (a) logarithmic scaling and (b) normal scaling. In 
both charts the dashed line is the equation [chi a ] = 56.8 [a^(675)] 103 , which is the result of 
linear regression on the log-transformed values (n=96, r 2 =0.97). 


where 


Rrs(488) 

R„(551) 


(13) 


[chl is called the "empirically-derived" or "default" chlorophyll a concentration, and Co, Ci, C 2 , and C 3 
are empirically derived constants. 

A subtropical and temperate summer data set was constructed from stations from the MLML 2, 
GOMEX, COLOR, TN042 and TN048 cruises, and from stations below 45 °N from the TT010 cruise 
(Table 2). This data set includes both open-ocean and riverine-influenced stations. A cubic regression of 
log([chl a ]) against log(r 35 ) for measured [chl a] and R re (A.) in this data set resulted in values of c 0 = 
0.2818, Ci = -2.783, C 2 = 1.863, and c 3 = -2.387. The data and the regression line are shown in Figure 5. 
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Table 2. List of cruises with optical and bio-optical data collected by the University of South Florida (Carder data set) for initial tests 


of the unpackaged algorithm. 


cruise 

start date 

end date 

region 

# stations | 

MLML2 

13 Aug 91 

29 Aug 91 

North Atlantic, 42°N-60°N 

7 

TT010 

20 Jul 92 

02 Aug 92 

North Pacific, 24°N-48°N 

10 

GOMEX 

10 Apr 93 

19 Apr 93 

Northern Gulf of Mexico 

21 

COLOR 

31 May 93 

09 Jun 93 

Northern Gulf of Mexico 

13 

TN042 

29 Nov 94 

18 Dec 94 

Arabian Sea 

12 | 

TN048 

21 Jun 95 

13 Jul 95 

Arabian Sea 

41 | 


Total number of stations is 104 



Figure 5. The [chi a] versus R IS (488)/R re (55 1 ) in log-log scaling. The 
dashed line represents a cubic regression on the log-transformed values 
and describes the default [chi a] algorithm. 

3.1.2.7 Weighted Pigment Algorithm 

Another consideration is that there should be a smooth transition in [chi a] values when the 
algorithm switches from the semi-analytical to the empirical method. This is achieved by using a 
weighted average of the [chi a] values returned by the two algorithms when near the transition border. 
When the semi-analytical algorithm returns an ^(675) value between 0.015 and 0.03 m‘ l , [chi a] is 
calculated as 
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(14) 


[chi a] = w [chi a ] sa + (1 - w)[chl a] emp 

where [chi is the semi-analytically-derived value and [chi n] emp is the empirically derived value, and 
the weighting factor is w = [0.03-a<p(675)]/0.015. 

3.1.2.8 Phytoplankton and CDOM Absorption Algorithms for Default Case 

When the semi-analytical algorithm does not return a value for a,(675), we provide empirical, 
multi-wavelength algorithms for a^(675) and ^(400) based on a^,(440) and ag(440) [Lee et al., 1998]. 
Using these results, the empirical, default algorithms for high a^,(675) values were determined by 
adjusting Lee’s results to 675 nm for phytoplankton and to 400 nm for gelbstoff, 

a/675 ) emp = 0.328 * [ 10 -°M+> 037p 2r 0.407ti r 3.531p is H.702p] 5 _ Q QQgJ (15) 

and for high ag(400) values, 

a g (400 ) emp = i,5*[iQ-‘ I47 -‘- 963 P,r>oip] s ^s56p a+ iJ02p^ (jg) 


where py are log of the ratio of the remote sensing reflectance of MODIS channel i to channel j. 

Again, a weighted absorption algorithm similar to Equation 14 is used for each of these 
components to transition from the semi-analytical expression to the default expression. 

The precision for the total absorption coefficients calculated empirically by Lee et al., [1998] 
determined by the goodness of fit were 15.3 %, and for pigment absorption coefficients at 440 nm it was 
29.1%. Transferring to 675 nm and considering global data sets will likely increase this uncertainty to 
perhaps 35% to 40%. The uncertainty of default ag(400) values is expected to be between 35% and 50%, 
although as with the semi-analytical values, there is a paucity of data sets to firm up these estimates. 

3. 1.2.9 Total and Phytoplankton Absorption Coefficients 

The phytoplankton absorption coefficients a^X t ) are calculated by inserting the modeled a p (675) 
value into Eq. 8 and by using the necessary parameters from Table la for each wavelength. The total 
absorption coefficients a(ki) are calculated by inserting the modeled a g (400) value and the S parameter 
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from Table lb into Eq.9 to get a/A.,) and then combining the result with the a 9 (kj) and a w (X,) values using 
Eq. 7 and Table la. 

3.2 Numerical computation 

Here n 9 (675) and ^400) are determined from Eq. 10 with the substitutions discussed in section 
3. 1.2.4. Inverting one of the equations to isolate Ug(400) and substituting into the other equation, all terms 
are then moved to one side. This yields a function that depends only on a 9 (675) (given values for R re and 
the algorithm parameters from Tables la and lb). The value of a 9 (675) at which the function crosses zero 
is the solution we seek. This solution is determined computationally via the bisection method. A 33- 
element array of a 9 (675) values, scaled logarithmically from 0.0001 to 0.03 m" 1 is created, and the 
function is evaluated at the two extremes. If the function changes sign between the two outermost values, 
a solution exists on the <z 9 (675) interval. The function is then evaluated at the midpoint of the array, and 
the half in which the function changes sign becomes the new search interval. In this manner the solution 
interval, which will be two adjacent points on the tf p (675) array, is determined in five iterations. Linear 
interpolation across the interval yields the semi-analytical n 9 ( 675) value, and a^40Q) is determined via 
either one of the ratio equations (Eq. 10) using the modeled value of a 9 (675). If the function does not 
change sign across the two outermost values, a solution cannot be obtained and a switch is made to the 
empirical, two-wavelength, default algorithm. 

When compared to an older method (look-up tables [Carder et al , 1991]), the bisection method 
gave identical solutions to within five significant digits for a 9 (675) and ^400), and the code ran in 75% 
of the time required by the version of the code based on the look-up table. 

The algorithm code is written in American National Standards Institute (ANSI) C. The program 
file contains about 300 lines of code and comments. It was developed and tested on a DEC Alpha 
machine. It was also tested on Silicon Graphics, SUN, and PC machines. All of the algorithm parameters 
listed in Tables la and lb are read in from a file, so different parameter tables can easily be constructed 
for different applications (see below). The code is available upon request. 

4.0 Algorithm Evaluation (ATBD v.5: April 1999) 

4.1 Statistical criteria 

To evaluate algorithm performance, we generated statistics that are determined on log- 
transformed variables so as to provide equal weight to data from all parts of the pigment and reflectance 
ranges. The slope and intercept values are from type II RMA regressions. The RMS statistic described is 
based on the root-mean-square of the logarithm of the ratio of modeled-to-measured values [O Reilly et 
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al., 1998] and will be referred to here as RMS1. We also generated values for r 2 and root-mean-square 
error on the non-log-transformed (linear) data. This statistic will be referred to as RMS2 and is described 
by 


RMS2 = 


n 

i 

Xmod,/ Xobs.i 

2 

Xobs,i 


V n-2 


(17) 


where x moi , is the modeled value of the ;th element, x obs ., is the observed (or in situ or measured) value of 
the z'th element, and n is the number of elements. Note that 10 RMS1 -1.0 ~ RMS2 if there is no bias between 
the modeled and measured data. We used two graphical means of evaluating algorithm performance, 
scatterplots of modeled versus observed values and quantile-quantile plots [O’Reilly et al., 1998], 

4.2 Tests with University of South Florida Data 
4.2.1 Methods 

We initially tested our algorithm with our own subtropical and temperate-summer data set, called 
the Carder data set (Table 2), since observed values of a<p(675) and ag(400) are included wherever possible 
to accompany the observed R re (/.) and [chi a ] values. Also, 17 points from high-chlorophyll, high- 
scattering stations, mostly from the extended Mississippi River Plume region, are included. We later test 
the algorithm with global R^k) and [chi a ] data collected by international research teams and found in the 
NASA SeaBASS bio-optical data archive [Hooker et al., 1994], 

R re (412), R rs (443), Rrs(488), R re (531), and R re (551) for the Carder data set were derived from 
hyperspectral R rs (^) measurements collected just above the sea surface (for measurement protocols, see 
Lee et al., (1996)) by weighting to simulate the MODIS band responses [Barnes et al., 1994]. Most R„(a) 
measurements in the SeaBASS global database were collected from just below the sea surface following 
the protocols of Mueller and Austin [1995], Both data types are combined in algorithm performance tests 
against the global data. 

All [chi a ] values were determined fluorometrically [Holm-Hansen and Riemann, 1978; Mueller 
and Austin, 1 995], The ag(400) was determined by measuring the absorption of seawater filtered through 
a 0.2 pm pore-sized nylon filter in a spectrophotometer when compared to a MilliQ water blank [Mueller 
and Austin, 1995]. 

The method used to determine absorption coefficients for particles and for detritus involves 

filtering as much as 4 L of water through a 25 mm diameter, Whatman glass fiber filter (GFF). The 

protocols used are those discussed in Mueller and Austin [1995] and are based on methods developed by 
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Shibata [1958], Roesler et al., [1989], Mitchell [1990], Nelson and Robertson [1993], and Moore et al., 
[1995]. In order to estimate absorption coefficients from the optical density, OD, measurements, an 
optical path-elongation factor, called p, which is dependent upon OD, is employed. Recently, however, it 
has been shown that p varies with the particle size prevalent to a region [Moore et al., 1995], This 
happens because smaller particles get more deeply embedded into the filter, providing a greater 
absorption cross section for photons scattered back and forth across the particle substrate than do the large 
particles remaining at the surface of the pad. For our work we chose a P factor appropriate for small, 
subtropical particles that falls between two published P factors, one developed for detritus [Nelson and 
Robertson, 1993] and one for Synechococcus [Moore et al., 1995]. Our P factor is 

0 = 1.0 + 0.6 OU 05 (18) 

Furthermore, we loaded the filter pads enough that the pad optical density exceeded 0.04 at 675 nm 
[Bis sett et al., 1997] for more accuracy in a<p(Ji) measurements. 

4.2.2 Results 

The algorithm parameters used are shown in Tables la and lb. [chi a], a^(675), and ag(400) 
values were predicted by the semi-analytical algorithm with RMS 1 errors of 0. 1 22, 0. 1 3 1 , and 0.252, 
respectively, and with RMS2 errors of 0.289, 0.302, and 0.405, respectively (Table 4), for a largely 
subtropical or temperate-summer data set. 

The results are also shown as scatter and quantile plots (Figure 6). The crosses on the plots are the 
points determined with the semi-analytical portion of the blended algorithm, and the diamonds represent 
points determined with the default or empirical algorithm. The high-chlorophyll points extend nicely 
along the one-to-one line on both the scatter and quantile plots. The RMS1 and RMS2 errors for [chi a] 
for this composite data set (n= 1 04) were 0.132 and 0.300, respectively. 

The [chi a ] and a^(675) data appear to be quite evenly clustered about the one-to-one line on both 
scatter and quantile plots with no aberrant tails at either end. The ag(400) points are predominantly below 
the one-to-one line and show a low bias. There are only 26 points in this plot because measured values of 
ag(400) are infrequently available for comparison. These data were subtropical except for some late- 
summer, temperate data, and they had relatively large specific absorption coefficients. Thus they are 
representative of the domain we designated as unpackaged. 
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Figure 6. Algorithm performance for Carder data set, top observed versus modeled chi a, middle observed 
versus modeled ap h (675), and bottom observed versus modeled a g (400) in left scatterplots and right quantile- 
quantile plots. The lines are the one-to-one lines, SA (crosses) indicates points that are calculated 
semianalytically or by a blend of semi analytical and empirical values, while EMP (diamonds) indicates 
points that are calculated empirically. 
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4.3 Tests using a global data set 

4.3.1 Data Set 

A large (n= 919) global evaluation data set consisting of measured values at the Sea-viewing 
Wide-field-of-view Sensor (SeaWiFS) wavelengths and chlorophyll a measurements based on both 
fluorometric and high-performance liquid chromatography (HPLC) methods was archived by the NASA 
SeaWiFS Project as the SeaBAM data set [O'Reilly et al., 1998]. These data came from various 
researchers around the United States and Europe and contain mostly subsurface R r values. In addition to 
these data, we have received 36 data points from the equatorial Pacific, consisting of R re measurements 
made above the surface (EqPac, courtesy of C. Davis), and we collected additional above-water data sets 
from the Southern California Bight (SCB) (April 1997 with G. Mitchell), near Hawaii (February 1997 
with D. Clark), and the Kuroshio edge of the East China Sea (May 1997 with G. Gong), which we have 
added to the global data set. This combination of 976 data points allows for algorithm comparisons using 
a data set consisting of both above- water and below-water points. 

The recent SCB data set provided an opportunity to compare above-water R re data with the 
historical California Cooperative Oceanic Fisheries Investigations (CalCoFI) SCB subsurface R re data set 
from the SCB. We also measured phytoplankton absorption spectra in the SCB in late winter to adjust 
a curve parameters, providing a more “packaged” parameterization (Table 3) for modeling the 
multiyear CalCOFI data set of subsurface Rre values and similar eastern boundary environments. 

4.3.2 Numerical Filters 

Since many different locations and sensors were involved in compiling the SeaBASS data 
collections and as many as eight separate upwelling and downwelling sensor channels must be well 
calibrated to provide accurate spectral ratios of R n for the semi-analytical algorithm, an attempt was made 
to select an initial core set of data consistent with case 1 waters and with each other. Also, an attempt was 
made to partition the data sets into two regions, ones where little pigment packaging is to be expected 
(e.g., high-light, non-upwelling locations in warm, tropical and subtropical waters) and one where more 
packaging might be expected (e.g., eastern boundary upwelling, and non-summer high-latitude data). To 
assist in this task, each SeaBASS data set was individually examined. 

Some of the data sets were composed of data largely falling into a single type of bio-optical 
domain according to the numerical filters discussed below. Others spanned two or more domains. Data 
sets falling primarily into the domain where the pigments appeared to be relatively unpackaged with 
significant photoprotective pigments [e.g., high a<,’(443) and high a p (443)/a<p(675)] were tested using the 
semi-analytical algorithm with the parameters shown in Tables la and lb. 
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Table 3. Algorithm parameters used with the "packaged" 
and modified global data sets. All algorithm parameters 
not listed here are the same as in Table 1 . 


parameter 

packaged 

global 

ao(412) 

1.90 

1.95 

ao(443) 

2.70 

2.95 

ao(488) 

1.90 

1.99 

a 2 (X) 

-0.45 

-0.5 

a 3 (k) 

0.021 

0.025 

Po 

74.1 

72.4 

Pi 

1.0 

1.0 

Co 

0.4818 

0.3147 

Cl 

-2.783 

-2.859 

C 2 

1.863 

2.007 

C 3 

-2.387 

-1.730 


The first numerical filter compares the data sets to the historic [Gordon et al., 1983] CZCS 
chlorophyll pigment algorithm [Chi a = 1.14 fas]' 1 ' 71 , r 25 = R„(443)/R„(550)] to check for consistency 
with this classical algorithm for case 1 waters with relatively little packaging. Figures 7d, 8d, and 9d 
show scatterplots of observed [chi a ] versus r 25 for different sets of warm-water data, with the CZCS 
algorithm illustrated by the dashed line. The warm-water, subtropical and tropical data sets, with little in 
the way of pigment packaging and probably high photoprotective to chlorophyll ratios (Figure 7d), were 
found to be centered over the CZCS algorithm for pigment values less than about 1 mg m' 3 . 

When the CZCS algorithm was applied to data from high-latitude or eastern boundary upwelling 
locations where pigments are packaged into larger cells with more self-shading (Figure 8d), however, the 
CZCS-like chlorophyll a values were typically 50% to 90% lower than those measured. This effect of 
differing bio-optional domains on the performance of the CZCS-like algorithm suggests that regional 
algorithms are needed to obtain best results for different regions or seasons as suggested by Mitchell and 
Holm-Hansen [1991], 

This "filter" approach helped us separate various data sets into two domains, which we call the 

unpackaged-pigment domain and packaged-pigment domain. This type of domain-selection filter, 
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however, cannot be applied to satellite-derived data because of the need for measurements of [chi a\. 

Thus a second type of numerical filter was sought that was reliant on only space-derived data. 

The second numerical filter uses the ratios r 12 = R IS (412)/R rs (443) and r 25 (Figures 7b, 8b, and 9b) 
and is applicable to satellite-derived data. For the Carder data set the line r 12 = 0.95 [r 25 ] 016 was used to 
separate high-gelbstoff data points (those below the line in Figure 7b) from the case 1 data. On the basis 
of the Carder a^ data, the gelbstoff-rich case 2 data had ag(400) values typically in excess of the 
relationship 0.12 [chi a] 0 7 or ag(443) > a^(443). This line also separates case 1 data representative of more 
packaged domains from those representative of less packaged domains since the ratio ri 2 is less affected 
by packaging effects than is r 25 . Thus, for waters far from terrigenous influences, the second filter 
provides a flag for packaging effects. 

Case 1 waters with more packaged pigments from a traditional upwelling region (e.g., CalCOFI) 
were also examined using the second numerical filter. These data fell mostly below the filter line (Figure 
8b) in comparison to the unpackaged data, which fell mostly above the line (Figure 7b). Since pigment 
packaging reduces the absorption for a given concentration of pigments far more at 443 than at 551 nm, 
and somewhat more at 443 than at 412 nm [Morel andBricaud, 1981], packaging and reduction of 
photoprotective pigments significantly increases r 25 while increasing the r 12 ratio somewhat. This places 
packaged data points below the ri 2 = 0.95 [r 2 s]° 16 line (Figure 8b) even without excessive gelbstoff 
concentrations, at least for points with r 2 5 values in excess of a value of about 3.0. Filter 2, then, provides 
a space-based method for separating data points with packaged pigments from those with unpackaged 
pigments, at least for the oligotrophic end of the data sets. 

Filter 2, however, can be distorted by a poorly calibrated sensor or by inaccurate atmospheric 
correction, so it is inadvisable to use it exclusively as a packaging filter without some means of providing 
an independent check on its performance. A means of accomplishing this task is found in Section 5. 

4.4 Algorithm evaluation with the "unpackaged" data set 

Those data sets generally found to be consistent with the CZCS algorithm line and which were 
located primarily above the line r ]2 = 0.95 [r 25 ]° 16 for points where r 2 5 > 3.0, were numerically classified 
as unpackaged, in reference to the pigment-absorption effects on the optics prevalent at those locations at 
the time of data collection. Those data sets with high-r 25 points largely below the line were classified as 
packaged, and a test of a packaged algorithm is developed and discussed below. Those data sets with 
high-r 25 points fairly equally divided by the line were withheld from the tests of both the unpackaged and 
packaged algorithm, but they were included as part of the test of a global algorithm developed and 
discussed below. 
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There are 287 data points in the unpackaged ensemble data set: 134 USF data points and 37 
EqPac, equatorial Pacific points, all measured above water and processed using the Lee et ai, [1996] 
protocols, and an additional 126 EqPac points, all measured below the surface using the Mueller and 
Austin [1995] protocols. Of these points, 261 (91%) were processed by the semi-analytical portion of the 
| [chi a] algorithm, yielding RMS1 and RMS2 errors of 0.099 and 0.230, respectively (Table 4). The 

scatter (Figure 7a) and quantile (Figure 7c) plots overlay the one-to-one line at the ends as well as in the 
middle. For the log-transformed variables, the type II RMA slope was 0.999, the bias was 0.002, and the 
correlation coefficient r 2 was 0.873. When all 287 data points were considered using the semi-analytical 
algorithm plus the blended and empirical algorithms, RMS1 and RMS2 errors were 0.108 and 0.242, 
respectively. The type II RMA slope was 0.973, the bias was -0.003, and r 2 was 0.955. Note that for this 
unpackaged data set, the transition from the semi-analytic to the default algorithm is reasonably smooth 
(Figure 7c). 

Table 4. Summary of regression statistics for each data set tested. The unpackaged data consists of the 
Carder, EqPac above-surface, EqPac below-surface, Taiwan, and MOCE3 data sets. The packaged data 
consists of the CalCOFI and CAL9704 data sets. The global data consists of the global evaluation data set, 
minus the Cota and U. Maryland data plus the high-chlorophyll Carder, EqPac above-surface, Taiwan, and 
MOCE3 data, and uses one set of average algorithm parameters for the whole data set. S A indicates that only 
the modeled values that passed the semi-analytical portion of the algorithm are used (including blended 
values). SA+EMP indicates that all modeled values— semi-analytical, blended, and empirical— are used. All 
statistics except RMS2 are calculated from logio-transformed variables. 


data set 

variable 

n 

intercept 

slope 

bias 

R 2 

RMS1 

RMS2 I 

Carder 

chi SA 

86 

0.019 

1.020 

0.010 

0.921 

0.122 

0.289 1 

| Carder 

chi SA+EMP 

104 

-0.007 

0.977 

-0.002 

0.963 

0.132 

0.300 

1 Carder 

a*(675) SA 

82 

0.098 

1.052 

-0.008 

0.898 

0.131 

0.302 

1 Carder 

ag(400) SA 

26 

-0.278 

0.905 

-0.186 

0.751 

0.252 

0.405 

| unpackaged 

chi SA 

261 

0.001 

0.999 

0.002 

0.873 

0.099 

0.230 

unpackaged 

chi SA+EMP 

278 

-0.019 

0.973 

-0.003 

0.955 

0.108 

0.242 

packaged 

chi SA 

303 

-0.006 

0.999 

-0.006 

0.917 

0.111 

0.268 I 

packaged 

chi SA+EMP 

326 

0.004 

1.012 

-0.003 

0.951 

0.114 

0.282 I 

global 

chi SA 

883 

0.002 

1.003 

0.002 

0.852 

0.176 

0.446 

global 

chi SA+EMP 

976 

0.003 

1.003 

0.002 

0.913 

0.174 

0.440 
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Figure 7. Algorithm performance for and analysis of data sets passing the “unpackaged” numerical filter, (a) Scatterplot of observed 
versus modeled chi a (mg m' 3 ). The dashed line is the one-to-one line, (b) The r I2 versus r 2 5 , with the line, r 12 =0.95[r 25 ] 016 , used to 
identify “unpackaged” case 1 data (above line), (c) Quantile-quantile plot of observed versus modeled chi a. (d) Observed chi a 
versus r 25 , with the Coastal Zone Color Scanner (CZCS) algorithm line C=1.14[r 25 ]' 1,71 . 
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Table 4 provides a complete summary of these statistics. Note that since these algorithms are 
largely semi-analytical in nature and were developed based mostly upon Gulf of Mexico data for the 
parameterization, one would not expect to have slope values of 1 .000 and bias values of 0.000. Note also 
that the r 2 values increased using the blended algorithm because of the extended range of chlorophyll a. It 
is important to note that RMS2 errors of less than 25% significantly exceed our accuracy goal of 35% or 
less. 

4.5 Algorithm evaluation with the "packaged” data set 

Several data sets within the global evaluation set were numerically diagnosed as coining from 
waters where the pigments were much more packaged than those from the warm, tropical and subtropical 
data sets evaluated earlier. The new packaged parameters, shown in Table 3, are used to define a slightly 
different, packaged algorithm for upwelling and winter-spring temperate regions. 

There are 326 points in an ensemble of multiyear, multiseason data sets from the California 
Current which we label as packaged. These consist of historical CalCOFI (n=303) and recent Cal9704 
(n-23) data which we recently collected with G. Mitchell. The CalCOFI data were subsurface 
measurements, while the Cal9704 data were above-surface collections. Three hundred and three points 
(93%) from this packaged data set passed the semi-analytical portion of the new algorithm, yielding 
RMS1 and RMS2 errors for [chi a ] retrieval of 0.1 1 1 and 0.268, respectively. The type II RMA slope 
was 0.999, the bias was -0.006, and the r 2 value was 0.917. The scatterplot (Figure 8a) overlays the one- 
to-one line, and the quantile plot (Figure 8c) is linear and overlies the one-to-one line but has a slight 
discontinuity near a chlorophyll value of 3. This indicates that some parameter modifications for the 
packaged algorithm are needed in this transition region. 

Using the blended algorithm on 326 data points, the r 2 increased to 0.951 while the other statistics 
remained about the same (Table 4). The RMS2 error of about 28% for the packaged algorithm also is 
better than our accuracy goal of 35% or less. 

4.6 Algorithm evaluation with a global data set 

To generate an algorithm to smoothly transition between regions and periods where there axe 
packaged and unpackaged pigments, we developed a global data set combining the packaged, 
unpackaged, and other mixed data sets from the SeaBASS archive. This data set has 976 data points. We 
then developed a set of compromise parameters for this global-average algorithm, shown in Table 3, for 
use at times and places where packaging is unknown or transitional. For this data set and these average 
parameters, 883 (90.5%) of the points passed the semi-analytical portion of the algorithm, yielding RMS1 
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Figure 8. Algorithm performance for and analysis of data sets passing the “packaged” numerical filter, (a) Scatterplot of observed 
versus modeled chi a (mg m* 3 ). The dashed line is the one-to-one line, (b) TTie r 12 versus r 25 , with the line, ri 2 =0.95[r 25 ] 016 , used to 
identify “packaged” data (below line), (c) Quantile-quantile plot of observed versus modeled chi a. (d) Observed chi a versus r 25 , with 
the Coastal Zone Color Scanner (CZCS) algorithm line C=1.14[r 2 s]' L71 . 
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and RMS2 errors in algorithm-derived [chi a ] of 0.176 and 0.446, respectively. The type II RMA slope 
was 1 .003, the bias was 0.002, and r 2 was 0.852. Statistics for the entire n=976 set were similar, except r 2 
was higher (0.913) (Table 4). The scatterplot (Figure 9a) looks evenly clustered about the one-to-one 
line, and the quantile plot (Figure 9c), though sinuous, overlays the one-to-one line for the most part. 
Again, the results from the semi-analytical algorithm fall below those for the default algorithm near the 
transition region. This can be alleviated by reducing the "blending" domain from 0.015 < a^(675) < 0.03 
to 0.008 < a*(675) < 0.015. 

If we are unable to accurately specify the packaging domains of the world ocean, such a 
compromise global algorithm with about 44% accuracy is likely to be the best accuracy that can be 
achieved. This does not meet our accuracy goal of 35% or better, so a focused effort is being made to 
develop an accurate sorting mechanism based on space-derived data to define the bio-optical domains of 
the ocean on a spatial and temporal basis. 

5.0 Strategy for Implementation of Variable Package Parameters (ATBD v.5: April 1999) 

5.1 Nitrate depletion temperatures 

The biggest limitation to algorithm development for the global ocean is a paucity of bio-optical 
field data from around the globe that are complete with ancillary particle and gelbstoff absorption spectra 
and auxiliary data such as sea-surface temperature, salinity, and nutrients. These data are needed in order 
to assess the spatial and temporal variation in the key algorithm parameters: X, Y, S, and, most important, 
ao(7) and a^X). In order to derive [chi a), it is vitally important to be able to predict how the a^(X.)/[chl a ] 
ratio varies. Thus we must study the effect of light history, which is related to season, cloudiness, and 
latitude, as well as nutrient history, which is influenced by mixed-layer depth, upwelling, river plumes, 
and offshore/onshore proximity. 

While algorithms appropriate for regions with strictly packaged or unpackaged pigments can 
reduce the uncertainty in chlorophyll a concentration from perhaps 45-50% to less than 30%, methods 
based upon space-derived data that determine when and where to apply the appropriate parameterization 
are still under development. One method using space-derived data (numerical filter 2) has already been 
discussed, but it is only definitive for waters where r 2 5 > 3.0. Also, stations with high gelbstoff 
concentrations can cause confusion using this method, and inaccurate atmospheric correction can cause 
confusion using this method on any given day. For offshore oligotrophic to mesotrophic waters, however, 
it is a very useful diagnostic tool if used under clear atmospheric conditions. 
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Figure 9. Algorithm performance for and analysis of global data sets without partitioning into “packaged” and “unpackaged” subsets, 
(a) Scatterplot of observed versus modeled chi a (mg m' 3 ). The dashed line is the one-to-one line, (b) The r ]2 versus r 25 , with the line, 
rj 2=0.95 [r 2 5] 016 - (c) Quantile-quantile plot of observed versus modeled chi a. (d) Observed chi a versus r 25 , with the Coastal Zone 
Color Scanner (CZCS) algorithm line C=1.14[r 25 ]* 171 . 


29 






sst=Triangle, NDT=Diamond, chi=Star 





Figure 10. Four-year (1982-1985), monthly mean values of sea-surface temperature 
(triangles), CZCS pigment (astericks), and nitrate-depletion temperature (diamonds) for 
locations near (top) the Gulf of Maine, (middle) Bermuda, and (bottom) Barbados. 

A second space-based approach uses the fact that unpackaged pigments are usually found in high- 
light, nutrient-poor waters where small-diameter phytoplankton cells predominate [e.g., Herblandet al, 
1985; Carder et al 1986]. Since dissolved nutrients cannot be detected from space, a nutrient surrogate 
was sought. Kamykowski [1987] developed a model that explained much of the covariance observed 
between upper layer temperatures and nitrate concentrations [e.g., Kamykowski andZentara, 1986]. D. 
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Kamykowski (personal communication, 1998) has since developed nitrate-depletion temperatures (NDTs) 
for the North Atlantic Ocean. The nitrate-depletion temperature is defined as that temperature at which 
nitrate could no longer be detected, at least using techniques of the era from about 1960-1985. These 
NDTs provide a means to observe from space a variable that indicates when and where nitrate may be 
limiting phytoplankton growth and where upper layer production is dependent upon recycled nitrogen. 
Such phytoplankton are typically small [Her bland et al., 1985] with unpackaged pigments [Carder et al., 
1986], 

To delimit regions of the North Atlantic Ocean that likely contain unpackaged pigments, we have 
compared sea-surface temperatures to Kamykowski’s NDTs. Figure 10 shows annual trends in sea- 
surface temperature, CZCS pigment, and NDTs for the Gulf of Maine, Bermuda, and Barbados. The 
temperatures and pigments are 4-year (1982-85) monthly averages from the Advanced Very High 
Resolution Radiometer (AVHRR) and CZCS sensors archived by the Jet Propulsion Laboratory, Physical 
Oceanography, satellite data archive (USA_NASA_JPL_PODAAC_A005). Note that based on this 
approach, waters in the Gulf of Maine are rarely designated as being nitrogen limited, and those near 
Barbados are always designated as being nitrogen limited, while those near Bermuda are designated as 
being limited in the summer and unlimited in the winter-spring. Clearly, the Gulf of Maine is a lower- 
light, higher-nutrient environment than are Bermuda and Barbados, so the degree of packaging there is 
likely to be much higher. 

By analyzing bio-optical data in the SeaBASS archive, some preliminary functional relationships 
between the NDTs and pigment-packaging classifications for the north Atlantic Ocean were empirically 
derived using sea-surface temperature (SST) derived from the AVHRR satellite sensor: (1) unpackaged 
domain: SST > NDT + 3.0° C, (2) transitional or global domain: NDT +1.8° C < SST < NDT + 3.0° C, 
and (3) packaged domain: SST < NDT + 1 .8° C. 

These domains for the months of February and August are shown for an El Nino year (1983) and 
a normal year (1985) in Plate 1, based upon climatological sea-surface temperatures. Here the tropics and 
most of the subtropics apparently contain phytoplankton with unpackaged pigments, except in the 
northwest African, Peruvian, and equatorial upwelling regions. Here transitional and packaged pigments 
can be observed during part of the year. High-latitude regions are mapped with this method as packaged, 
gradually transitioning toward the equator into the unpackaged domain again. Note the marked difference 
in the domains between El Nifio and “normal” years, especially in the tropics and subtropics. 

Using AVHRR SST data from the physical oceanographic data archive, bio-optical data from 
SeaBASS for a cruise crossing several domains were sorted by domain using NDTs. Data for the 
transition period from spring to summer from the NASA SeaBASS archive were sorted into the three bio- 
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Plate 1 . Bio-optical domains for the global ocean based on 1983 and 1 985 monthly mean temperature compared to nitrate-depletion 
temperatures (NDTs) for (a) February and (b) August. Here black, blue, green, and yellow regions depict land, packaged, transitional, 
and unpackaged domains, respectively. 
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Plate 1 . (Continued) 
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Figure 11 . Comparisons of algorithm-derived and measured chlorophyll 
a values for (top) domain-sorted data for the North Atlantic Ocean during 
the spring-summer transitional period and (bottom) unsorted data. 

optical domains, and the appropriate algorithm parameterization was applied to derive chlorophyll a 
values. May Atlantic Meridional Transect (AMT 4) data along 20° W longitude, North Sea data and 
MLML2 data collected in July, and GOMEX1 and GOMEX2 data collected in April and June provide a 
diverse set of north Atlantic observations that were sorted by the NDT filter and processed. The results 
(Figure 1 1) are compared with those obtained by simple use of the global (transitional) algorithm. The 
RMS1 and RMS2 errors for this diverse data set were 0.153 and 38%, respectively, for domain-sorted 
data, while the errors grew to 0.186 and 50%, respectively, when all were processed using global or 
transitional parameters for the algorithm without sorting by domain. 

Since the spring-summer transition and eastern basin, upwelling data are perhaps the least 

predictable in terms of pigment packaging, the error reduction observed of 12% is indicative of the 
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improvements that can be made by sorting ocean color data into bio-optical domains before applying 
algorithms. A community-wide effort to evaluate and modify this approach for other regions is an on- 
going task. 

5.2 Test of MODIS Algorithms with SeaWiFS Data 

A major end-to-end test of all MODIS Ocean Team products was undertaken under the direction 
of Dr. Robert Evans at the University of Miami using prototype MODIS processing software. SeaWiFS 
data were modified to match Level 1 data from the MODIS data stream and processed. This processing 
included atmospheric correction and production of Level 2, 3, and 4 products [Hawes et al., 2000]. All 
data products discussed under ATBD19 were successfully processed for a 2-day simulation of global 
data. These included a^(675), ag(400), [chi a], a(412), a(443), a(490), a(532), and a(551). These products 
were successfully passed on for processing of absorbed radiation by phytoplankton (MOD22) and 
fluorescence efficiency (MOD20). 

6.0 Expansion of bio-optical domains into high-latitude regions (ATBD v.6: May 2002) 

The dominant effects of pigment packaging when transitioning from tropical to temperate waters 
are included in the original version of the semi-analytic chlorophyll algorithm described previously in 
Section 3 and tested in Sections 4 and 5 [Carder et al., 1999]. The “packaged" parameter set, however, 
which was derived from Southern California temperate data significantly underestimates chlorophyll 
concentrations when applied to high-latitude data. 

Studies in the Southern Ocean indicate that chlorophyll-specific phytoplankton absorption 
coefficients are lower compared to Southern California waters due to increased pigment packaging and 
lower detrital absorption [ Sosik et al., 1992; Reynolds et al., 2001], Since phytoplankton absorption 
typically has the largest impact on remote-sensing reflectance ratios, such differences in a^>*(X) cause the 
relationship between oceanic reflectance ratios and chlorophyll a concentration in both Antarctic and 
Arctic waters [Mitchell and Holm-Hansen, 1991; Mitchell, 1992] to deviate from empirically derived 
relationships generated from mostly northern mid- and low-latitude data [e.g. Gordon et al., 1983; 

O Reilly et al, 1998], Indeed, the application of these latter two-band empirical algorithms to high- 
latitude CZCS and SeaWiFS imagery has consistently led to significant underestimations of chlorophyll 
concentration in the Southern Ocean [Sullivan etal., 1993; Moore et al., 1999] and the Bering Sea 
[Maynard and Clark, 1 987; Muller-Karger et al, 1990], Furthermore, estimates of primary production 
derived from models based on satellite-derived chlorophyll fields may be significantly underestimated in 
polar regions as Antarctic and Arctic waters contribute 20% to global annual primary production 
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[Behrenfeld and Falkowski, 1997]. 

Thus, the domain of “packaged” parameters, previously discussed in Section 4.5, is expanded in 
this section in order to reflect the inherent optical variability displayed in high-latitude polar waters. In 
addition, a smoother blending mechanism to transition between packaging domains is introduced that 
continues to be based on nitrate-depletion temperatures. 

6.1 Development of the “fully packaged 11 parameter set 

While the “unpackaged” parameter set remains the same as in Tables la and lb, the “packaged” 
parameter set has been revised. This domain is now based on Antarctic, Arctic and temperate upwelling- 
center data and will be referred to as the “fully packaged” parameter set. 

The shape of the a^) spectrum was modified for the fully packaged parameter set using data 
collected during spring from the Bering Sea (MF0796) and from the Antarctic Polar Frontal Zone 
(REV9802). Measurement locations, dates, and sources are listed in Table 5. The parameters ao(X) to a 3 (X) 
from Equation 8 were optimized to provide a minimum sum-of-squares error for modeled versus 
measured values of a 9 (>i)/a < p(675). Figure 12 shows the data and the modeled fully packaged curve for 
a<p(443) along with the unpackaged curve for comparison. All fully packaged algorithm parameters are 
listed in Table 6. 



Figure 12. The ratio 8^(443 )/a^(675) versus ^(675) for high latitude data 
from the Bering Sea (MF0796) and the Antarctic Polar Frontal Zone 
(REV9802). The lines are described by equation 8 using the parameters 
listed in Table la and Table 6. 


36 




Table 5. A list of cruises with optical and bio-optical data collected from southern and northern high latitude 
regions. When multiple R rs and [chi a] values were available for the same date and time, the average value 
was used, ‘n’ is the number of stations with both R re and [chi a] data available. ACPZ is the Antarctic Polar 
Frontal Zone. 


Cruise 

P. Investigator 

Institution 

Dates 

Location 

n 

ROAVERRS97 

Arrigo, K 

NASA/GSFC 

12/97-1/98 

Ross Sea 

32 

NBP9711 

Mitchell, G. 

UCSD/SIO 

11/97-12/97 

Ross Sea 

20 

REV9801 

Mitchell, G. 

UCSD/SIO 

1/98 

APFZ 

9 

REV9802 

Mitchell, G. 

UCSD/SIO 

2/98 

APFZ 

2 

AMLR2000 

Mitchell, G. 

UCSD/SIO 

2/00-3/00 

N. of Ant. Peninsula 

21 

AMLR2001 

Mitchell, G. 

UCSD/SIO 

2/01-3/01 

N. of Ant. Peninsula 

29 

LTER 

Smith, R. 

UCSB 

1991-1999 

W. of Ant. Peninsula 

758 

MF0796 

Carder, K. 

USF 

4/96 

Bering Sea 

29 

Ber96 

Cota, G. 

ODU 

7/96 

Bering Sea 

6 

Lab96 

Cota, G. 

ODU 

10/96-11/96 

Labrador Sea 

2 

Lab97 

Cota, G. 

ODU 

5/97-6/97 

Labador Sea 

26 

LabOO 

Cota, G. 

ODU 

5/00-6/00 

Labrador Sea 

8 

Goa97 

Cota, G. 

ODU 

10/97 

Gulf of Alaska 

■ 


The relationship between a<p(675) and [chi a] was modified using Cal9704 upwelling-center data 
to reflect the lowest chlorophyll-specific absorption coefficients expected for phytoplankton cells from 
high-nutrient regions that are exposed to low-light levels. A po coefficient of 79.4 in Equation 1 1 for fully 
packaged waters is equivalent to using an a/(675) value of 0.0126 m 2 (mg chi)' 1 , which is close to the 
lower limits determined in previous studies [Bricaud et ah, 1995; Cleveland, 1995]. 

The default empirical chlorophyll algorithm parameters, Co to C 3 , were modified using Cal9704 
and MF0796 data. Regression of logio([chl a]) against logio(Rrs(488)/R re (551)) for measured [chi a] and 
R re (k) in this data set resulted in values of Co=0.51, Ci=-2.34, C2—0.40, and c 3 =0.00 (n=35) using Equation 
12, yielding RMS1 and RMS2 errors of 0.102 and 25%, respectively. Figure 13 shows the data and the 
modeled fully packaged curve along with other high-latitude empirical expressions from the Southern 
Ocean [Arrigo et al, 1 998; Dierssen and Smith, 2000] and the Labrador Sea [Cota et al., 2000]. 
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Table 6. Semi-analytical algorithm parameters 
for unpackaged and fully packaged data sets. 


Parameter 

Unpackaged 

Fully packaged 

ao(412) 

2.20 

1.02 

ao(443) 

3.59 

1.89 

a<,(488) 

2.27 

1.24 

ao(510) 

1.40 

0.84 

ao(551) 

0.42 

0.32 

a, (412) 

0.75 

0.26 

a,(443) 

0.80 

0.45 

ai(488) 

0.59 

0.42 

a,(510) 

0.35 

0.36 

a,(551) 

-0.22 

-0.08 

a 2 (X) 

-0.50 

-0.45 

a 3 (X) 

0.0112 

0.0210 

po 

51.9 

79.4 

Pi 

1.0 

1.0 

Co 

0.2818 

0.51 

Ci 

- 2.783 

-2.34 

C 2 

1.8630 

-0.40 

C3 

-2.3870 

0.00 

s 

0.0225 

0.0170* 

Xo 

-0.00182 

-0.00182 

X, 

2.058 

2.058 

Y 0 

-1.13 

-1.13 

Y, 

2.57 

2.57 


* A CDOM slope coefficient, S, of 0.01 70 is used for Antarctic fully packaged 
data only. The slope for Arctic fully-packaged data shall remain 0.0225. 
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Figure 13. The [chi a] versus R fS (488)/R fS (551) default empirical algorithm for the “fully packaged” 
pigment domain. The Carder “fully packaged” line (solid thick) represents a quadratic regression on 
log-tranformed Bering Sea (MF0796) and Southern California (Cal9704) data. Also shown are high- 
latitude algorithms from the Ross Sea [Arrigo et al, 1998 ] for regions dominated by diatoms (dia) and 
Phaeocystis (pha) and by cryptophytes (cry), from the region west of the Antarctic peninsula [Dierssen 
and Smith, 2000], and from the Labrador Sea [Cota et al., 2000]. Lines are plotted approximately for the 
original range of values only. 


The new “fully packaged” algorithm is very similar to the Cota et al [2000] relationship for 
chlorophyll concentrations less than 5.0 mg m" 3 . At chlorophyll concentrations greater than 5.0 mg m' 3 , 
the relationship more closely resembles the Dierssen and Smith, [2000] line. The divergence of the 
Arrigo et al, [1998] diatom and Phaeocystis line away from the Cryptophyte line suggests that at high 
chlorophyll concentrations, no single empirical expression will perform accurately in the Southern Ocean. 
This is due to regional [Brody et al, 1992; DiTullio and Smith, 1996; Goffart et al, 2000] and seasonal 
[Moline et al, 1999] shifts in phytoplankton dominance together with the tendency of blooms dominated 
by a single class of phytoplankton to exhibit a large variability in a/(X). 

The slope coefficient, S, for the combined CDOM and detrital absorption term (Equation 9) was 
decreased for the fully packaged parameter set from 0.0225 to a value of 0.017. Applying this lower 
slope value with the Antarctic data optimized algorithm performance and is thought to account for the low 
chlorophyll-specific detrital [Sosiket al, 1992; Clementson et al, 2001] and CDOM [Reynolds et al, 
2001] coefficients observed in Antarctic waters and perhaps low photobleaching. Since Arctic waters 
exhibit higher diffuse attenuation coefficients for CDOM compared to Antarctic waters perhaps due to 

higher riverine input [Mitchell, 1992], we propose to continue using a slope coefficient of 0.0225 for 
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Arctic fully-packaged data and only use a value of 0.017 for Antarctic fully packaged data. 

The wavelength independent parameters (i.e. Xo, X ls Y 0 , and Yi) used in Equation 5 and 6 for 
determining the magnitude and shape of particulate backscattering in Equation 4 remain unchanged for 
the fully packaged parameter set. Although modeled [Dierssen and Smith , 2000] and measured [Reynolds 
et ah, 2001] results indicate that chlorophyll-specific particulate backscattering is lower in the Southern 
Ocean compared to northern mid- and low-latitude waters, Equation 5 corrects for this effect. Essentially, 
the lower green reflectance values (11^(551)) observed in high-latitude southern waters for a given 
chlorophyll concentration [Reynolds et al, 2001] shall yield lower bb P (551) values using this equation. 

6.2 Revised strategy for implementation of variable package parameters 

With the addition of the fully packaged parameter set, a new means for implementing the various 
packaging parameters within the semi-analytic algorithm is required. As the previous stair-stepping type 
of approach for transitioning between packaging domains based on nitrate-depletion temperature 
(discussed in Section 5.0) resulted in unrealistically, large chlorophyll variations in regions with sharp 
temperature changes (e.g. Florida Gulf Stream and upwelling regions) (data not shown), a smoother 
transition among the domains is needed. 

A blending mechanism identical to the one used to provide a smooth transition between the semi- 
analytic and empirical algorithm products (i.e [chi a], ^(675), and ag(400)) (discussed in Section 3. 1.2.7) 
has been established by studying an intense upwelling environment off southern California. Most of the 
variability in pigment packaging was observed to be limited to a 5°C range about the nitrate-depletion 
temperature (NDT). The most packaged pigments there resembled a ^(X) spectra observed during April in 
the Bering Sea data (Figure 12). Thus, phytoplankton absorption parameters from low-light and nutrient- 
replete polar waters (fully packaged) and parameters from high-light and nutrient-poor tropical/ 
subtropical and summer temperate waters (unpackaged) are designated as end members. A linear 
blending algorithm provides for the transition between the two end members. The algorithm is run first 
using both the unpackaged and the fully packaged parameter sets where NDT - 1.0°C < SST < NDT + 

4°C. Chlorophyll concentration is then calculated as 

[chla] = w[chla ] up + (1 - w)[chla ] Fp (19) 


where [chi a] UP is the unpackaged value, [chi a ]? P is the fully packaged value, and the weighting factor is 
w=[1.0 + (SST-NDT)]/5.0. For regions outside this transitional domain, the fully packaged parameter set 
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is used where SST < NDT-1.0°C and the unpackaged parameter set is used where SST > NDT+4.0°C. 
Thus, for any satellite location, the SST is compared to the NDT, and the level of pigment packaging is 
defined as either fully packaged, transitional, or unpackaged (Figure 14). The same approach is used to 
blend the unpackaged and fully packaged pigment domains for algorithm retrievals of a<p(675) and 
3,(400). 



SST-NDT (°C) 

Figure 14. Blending mechanism for transitioning between unpackaged and fully packaged 
pigment domain based on the difference between the sea surface temperature (SST) and the 
nitrate depletion temperature (NDT). The weighting factor used in Equation 19 for SST- 
NDT < -1 .0°C is zero, for-1 ,0°C < SST-NDT < 4.0°C is between zero and one, and for SST- 
NDT > 4.0°C is one. 


Algorithm performance using the revised pigment packaging parameters together with the 
smoother blending strategy was first evaluated using the Cal9704 data set. Cold (~12°C), nutrient-rich, 
and chlorophyll-rich waters upwelled off the coast of California in April 1997 (Figure 15), displayed low 
a u (443)/'a 0 (675) ratios consistent with low-light, shade-adapted phytoplankton populations observed 
during April in the Bering Sea. The data were sorted into the three bio-optical domains (fully packaged, 
transitional, or unpackaged) based on SST’s and NDT’s using the new blending strategy, and the 
appropriate algorithm parameters were applied. The algorithm results are shown in Figure 16. The 
RMS1 and RMS2 errors were 0.116 and 31%, respectively, for domain-sorted chlorophyll data. The 
errors, however, increased to 0.148 and 39% for RMS1 and RMS2, respectively, when average packaging 
parameters were used (e.g. Eq. 19 with a weighting factor of 0.5). Sorting ocean color data into bio- 
optical domains, therefore, reduced the error by 8% and allowed the algorithm to meet the 35% accuracy 
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Figure 15. Surface map of (a) Temperature (°C), (b) Nitrate concentration (pM l' 1 ), (c) [Chi a] (mg m' 3 ), and (d) 
a <p (443)/a t p(675) for the California upwelling region (Cal9704) in April 1997. 


goal originally proposed for this algorithm. 


6.3 Algorithm evaluation of a high-latitude data set 
6.3.1 Data set 

A large multi -region, multi-season, and multi-investigator high-latitude data set (n=871) has been 
compiled that includes Antarctic data extracted from the NASA SeaBASS data archive, spring Bering Sea 
data (MF0796), and Arctic data kindly provided by G.Cota (ODU) (Table 5). This data set which includes 
Rrs(X) and fluorometrically determined chlorophyll concentrations is used to test the revised packaging 
parameters along with the new transitioning strategy based on nitrate-depletion temperatures. Although a 
small subset of this data was used to derive the fully packaged parameter set being tested here, the only 
RnA) data used was to determine the default empirical chlorophyll algorithm (Figure 13). Since 
thedefault empirical algorithm is only used for waters with chlorophyll values greater than —1.0 mg m' 3 , 
less than 1% of the high-latitude data tested here were used during algorithm development. 
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0.0001 0.001 0.01 0.1 1 



obs. a a (400) 

Figure 16. Algorithm performance for domain-sorted California up welling data (Cal9704) 
(a) observed versus modeled [chi a], (b) observed versus modeled a<p(675), and (c) observed 
versus modeled ag(400). The lines are one-to-one lines, SA (+) indicates points that are 
calculated semi-analytically or by a blend of semi-analytical and empirical values, while 
EMP (0) indicates points that are calculated empirically. 
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Radiometric data adjustments were unnecessary for the three lowest MODIS wavelengths 
(Rrs(412), R re (443), and Rrs(488)) since instrument measurements were obtained within 2 nm of these 
wavelengths. The Rrs(565) data, however, collected using the Bio-Optical Profiling System (n=323) 
during the Long Term Ecological Research (LTER) program were converted to R rs (555) using the 
equation developed in O 'Reilly et al., [1998]. 

Data collected at low solar elevations with zenith angles greater than 70° were omitted. Also 
omitted were stations flagged with low R re values (R re (412) < 0.0008 sr' 1 and R re (551) < 0.001 sr" 1 ) or with 
inconsistent empirical and semi-analytic chlorophyll values. When the fully packaged parameter set was 
applied, the semi-analytical and empirical output products ([chi a], a^(675), ag(400)) were blended 
according to Equation 14 when the semi-analytic algorithm returned a,(675) values between 0.0075 and 
0.015 m' 1 . This differs from the range used for unpackaged date (0.015 - 0.030 m 1 ) as a result of the 
lower chlorophyll-specific absorption coefficient observed in high-latitude waters compared to mid- 
andlow-latitude waters. 


The Antarctic data (ROAVERRS97, NBP971 1, REV9801 , REV9802, AMLR2000, AMLR2001, 
and LTER) consist of R re values determined from subsurface upwelling radiance extrapolated to just 
below the surface, L„(0' ,X), and downwelling irradiance extrapolated to just below the sea surface, 
Ed(0\L). Remote-sensing reflectance just above the sea surface, Rfs(0 + ,X.), was calculated using the 
following equation: 


Rrs(0\l) 


0.54L u ((r.t') 
1-04 Ea( O', A) 


( 20 ) 


where 0.54 and 1.04 are transfer coefficients across the air-sea interface for L u and Ed, respectively 
[Austin, 1974]. All of the Antarctic data except for that collected during the AMLR2000 and AMLR2001 
cruises are corrected for ship shadow effects. 

The spring Bering Sea data set (MF0796) includes hyperspectral R^A.) measurements collected 
just above the sea surface and processed according to Lee et al, [1996]. Cota’s Bering Sea (Ber96), 
Labrador Sea (Lab96, Lab97, and LabOO), and the Gulf of Alaska (Goa97) Rrs data were calculated using 
Equation 20 from subsurface Lu(X) and E d (X) data. 

The in situ chlorophyll concentrations for the entire high-latitude data set ranged between 0.06 
and 40 mg m' 3 . Mean and median values for the Antarctic data are 2.9 mg m’ 3 and 1 .4 mg m 3 , 
respectively. Similar values for the Arctic data (2.5 mg m 3 and 1.5 mg m“ 3 , respectively) were observed. 

Data collected in the Southern Ocean represent the majority (92%) of the high-latitude data set. Most of 
these data (87%) were collected west of the Antarctic peninsula (LTER) between 1991 and 1999 and over 
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95% of the data were collected between November and February. Since the majority of the Antarctic data 
were collected either near coastal polynas greatly influenced by sea ice meltwater (i.e. Ross Sea) or 
adjacent to continental margins (i.e. north and west of the Antarctic peninsula), the Antarctic data set over 
emphasizes bloom-dominated regions and under emphasizes oligotrophic regions influenced by the major 
circumpolar currents. 

6.3.2 Antarctic data 

Compared to the two-band empirical CZCS algorithm [Gordon et al., 1983], the Antarctic data 
exhibited higher R rs (443)/R rs (551) ratios on average for a given chlorophyll concentration (Figure 17d). 
This may be attributed to lower a^*(X) due to increased pigment packaging, decreased chlorophyll-specific 
detrital and CDOM absorption, and decreased chlorophyll-specific backscattering compared to northern 
mid- and low-latitude data where the CZCS algorithm was developed [Sosik et al., 1992; Dierssen and 
Smith, 2000; Reynolds et al., 2001], The first two factors would result in elevated R IS (443) values while 
the third factor would result in decreased R rs (55 1) values. 

Lower chlorophyll-specific detrital and CDOM absorption relative to values from northern mid- 
and low-latitude waters is also evident in the high R rs (412)/R rs (443) ratios exhibited at high chlorophyll 
concentrations (or low R re (443)/R rs (551) ratios) using the second numerical filter discussed in Section 

4.3.2 (Figure 17b). The R re (412)/R re (443) ratios are higher than those observed in the global data set 
(Figure 9b) as CDOM and detrital absorption increase at a slower rate with increasing chlorophyll 
concentration causing R re (443) to decrease faster than R IS (412). At low chlorophyll concentrations (or 
high R rs (443)/R rs (551) ratios), the Antarctic data fall mostly below the line ri2=0.95[r2 5 ] 0 16 similar to the 
Southern California upwelling data (Figure 8b) and are consistent with pigments derived from a highly 
packaged pigment domain. 

Since sea-surface temperatures were not available for the Antarctic data set, the semi-analytic 
algorithm was run using the fully packaged parameter set only. The scatterplot of measured versus 
modeled chlorophyll concentrations overlies the one-to-one line (Figure 17a). Similarly, the quantile- 
quantile plot (Figure 17c), though sinuous, overlies the one-to-one line quite well for chlorophyll 
concentrations less than 10 mg m' 3 . At chlorophyll concentrations greater than 10 mg m' 3 , the increased 
scatter and bias may be attributed to the unique optical signatures associated with monospecific blooms 
(i.e. Figure 13) and is most likely the main reason why the default empirical algorithm fails to perform 
accurately here. 

Processing the Antarctic data using the SeaWiFS OC4 (version 4) empirical algorithm [O Reilly 
et al., 2000] results in significant underestimations of chlorophyll concentration between -0.2 and 10 mg 
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Figure 17. Algorithm performance for and analysis of the Antarctic data set. (a) Scatterplot of observed versus modeled chi a 
(mg m' 3 ) calculated semi-analytically (+) and empirically (0). The dashed line is the one-to-one line, (b) The r 12 versus r 2 5 
relationship with the line, ri 2 = 0.95[r 25 ] 016 . (c) Quantile-quantile plot of observed versus modeled chi a. (d) Observed chi a 
versus r 25 , with the Coastal Zone Color Scanner (CZCS) algorithm line C=1 .14 [r 25 ]'' 7I . 


m' 3 . Errors such as this are readily apparent using graphical algorithm evaluation criteria (Figure 1 8). 
Statistically, however, the RMS2 error generated from non-log-transformed data is erroneously low due 
to the large bias (-0.267) that exists between the modeled and measured data (Table 7). Since the RMS2 
error (Equation 17) does not place equal weight on underestimates as on overestimates, and fails to 
provide the same error estimate when regressing X against Y as when regressing Y against X, we now 
linearize the RMS1 error generated from log-transformed data as follows 


RMS Un = 0.5 • [(10™ 5 -1)+(1 - 10'"®)] 


( 21 ) 


The RMS1 , RMS2, and RMSi in chlorophyll retrieval errors for the Antarctic data (n=871) using 
the semi-analytic algorithm were 0.22, 0.72, and 0.53, respectively. The RMS1, RMS2, and RMSi, n 
errors using the SeaWiFS OC4v4 algorithm for the Antarctic data were 0.35, 0.60, and 0.89, respectively. 
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Figure 18. SeaWiFS OC4v4 algorithm performance for the Antarctic 
data set. (a) Scatterplot of observed versus modeled chi a (mg m -3 ). 

The dashed line is the one-to-one line, (b) Quantile-quantile plot of 
observed versus modeled chi a. 

Table 7. Summary of regression statistics for each data set tested. The Antarctic data set consists of the 
following cruises: ROAVERRS97, NBP971 1, REV9801, REV9802, AMLR2000, AMLR2001, and LTER. 
The Arctic data set includes cruises in the Bering Sea (MF0796 and Ber96), the Labrador Sea (Lab96, Lab97, 
+ LabOO), and the Gulf of Alaska (Goa97). All statistics except RMS2 are calculated from logi 0 -transformed 
variables. 


Data Set 

algorithm 

n 

Intercept 

Slope 

Bias 

t 2 

RMS1 

RMS2 

RMSii n 

Antarctic 

Carder 

semi-analytic 

871 

-0.037 

1.008 

-0.036 

0.82 

0.219 

0.718 

0.525 

Antarctic 

SeaWiFS 

OC4v4 

871 

-0.271 

1.022 

-0.267 

0.81 

0.348 

0.604 

0.889 

Arctic 

Carder 

semi-analytic 

75 

-0.000 

1.038 

-0.009 

0.82 

0.172 

0.465 

0.406 

Arctic 

SeaWiFS 

OC4v4 

75 

-0.151 

1.038 

-0.143 

0.83 

0.223 

0.398 

0.537 











































Note, once again, that the RMS2 error is erroneously low when using the OC4v4 algorithm compared to 
when applying the semi-analytic algorithm due to the aforementioned large bias. This bias causes the 
linearized fractional error, RMSun, for OC4v4 to be ~1 70% higher than that for the semi-analytical 
algorithm. A bias of 27% for OC4v4 indicates that SeaWiFS-derived chlorophyll concentrations may 
perhaps be 25-30% low during austral summer which could result in similar underestimations of primary 
production and CO 2 uptake for the Southern Ocean. 

6.3.3 Arctic data 

The Arctic data are similar to the Antarctic data when plotting chlorophyll a concentration versus 
R re (443)/R re (55 1 ) (Figure 1 9d). The majority of the stations had higher reflectance ratios for a given 
chlorophyll concentration compared to the CZCS algorithm [Gordon et al., 1983], Coastal stations in the 




R rs (443)/R rs (551 ) 



Figure 19. Algorithm performance for and analysis of the Arctic data set. (a) Scatterplot of observed versus modeled chi a 
(mg m' 3 ) calculated semi-analytically (+) and empirically (0). The dashed line is the one-to-one line, (b) The r :; versus r 25 
relationship with the line, r 12 = 0.95[r 2 5]° 16 . (c) Quantile-quantile plot of observed versus modeled chi a. (d) Observed chi a 
versus r 25 , with the Coastal Zone Color Scanner (CZCS) algorithm line C=1.14 fas]' 171 . 
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Labrador Sea that were dominated by diatoms exhibited reflectance ratios that deviated the most from the 
CZCS line. Compared to the open-ocean stations in this region, the waters were colder and less saline 
perhaps due to ice-edge melting. Also, chlorophyll-specific phytoplankton absorption coefficients were 
relatively low and consistent with other studies of diatom-dominated waters in this region [Lutz et al., 
1996]. 

The spring Bering Sea data (MF0796) also exhibited relatively high reflectance ratios for data 
collected offshore of the 100m isobath with a g (443)/ap h (443) ratios less than 2. Spring stratification due to 
increased thermal heating and decreased wind mixing events had not yet occurred. Vertical profiles of 
temperature and salinity exhibited a well-mixed water column indicative of nutrient-replete conditions 
and favoring the growth of large, shade-adapted diatoms containing heavily packaged pigments (Figure 
12) [Goering and Iverson, 1981]. Stations located inshore of the 100m isobath with ag(443)/ap h (443) 
ratios greater than 2, however, exhibited reflectance ratios more consistent with the CZCS line. Such a 
tendency for increased pigment packaging to increase blue-to-green reflectance ratios but then be offset 
by increased ag(443)/ap h (443) ratios that decrease reflectance ratios is also evident in model simulations 
[Sathyendranath et al., 2001], 

The summer Bering Sea (Ber96) and the fall Gulf of Alaska (Goa97) R rs (443)/R re (551) ratios 
overlie the CZCS line (Figure 19d). These data were collected in highly stratified waters with surface 
mixed layers ~20m deep. Surface nutrient concentrations were depleted and daily solar fluxes were 
sufficient to favor the growth of picoplankton over microplankton [Odate, 1996], These nutrient-efficient 
and high-light-adapted phytoplankton species with higher chlorophyll-specific absorption coefficients due 
to decreased pigment packaging and increased photoprotective carotenoids are somewhat more similar to 
phytoplankton assemblages observed in coastal waters used in part to generate the CZCS algorithm. 

Chlorophyll-specific CDOM absorption was relatively low for Labrador Sea data and spring 
Bering Sea data collected offshore of the 100m isobath. The chlorophyll versus ag(400) relationship for 
these waters was lower than the relationship discussed in Section 4.3.2 for separating gelbstoff-rich mid- 
and low-latitude waters, but higher than that observed for Southern Ocean waters [Reynolds et al., 2001]. 
As a result, R re (412)/R rs (443) ratios for these waters (Figure 19b) were higher than the nearshore 
(gelbstoff-rich), spring Bering Sea data, but lower than the Antarctic data (Figures 17b). 

As the Arctic data set encompasses a wide range of bio-optical domains ranging from the deeply- 
mixed (light-limited and nutrient replete), spring Labrador Sea to the highly stratified (high-light and 
nutrient depleted), summer Bering Sea, the data were sorted based on nitrate-depletion temperatures to 
determine the appropriate level of pigment packaging prior to applying the semi-analytic algorithm. 
Although the correlation between nitrate concentration and temperature may be weak in high-latitude 
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oceanic waters [Cota et al., 2000] since other factors (i.e. light, Si, Fe, etc.) may be responsible for 
limiting phytoplankton biomass accumulation, we believe that deviations is sea surface temperatures from 
nitrate depletion temperatures are also indicative of the degree of water-column stratification. Increased 
water column stratification may decrease the level of pigment packaging by maintaining the 
phytoplankton in a higher light environment near the surface. For instance, if phytoplankton found in 
high-latitude waters are light-limited rather than nutrient-limited, then the increased vertical stratification 
that occurs during late spring due to increased solar insolation, decreased wind stress, and in some cases 
increased ice-edge melting would reduce this light-limitation [Sverdrup, 1953; Maynard and Clark, 1987; 
Mitchell et al„ 1991], The ratio of photo-protective carotenoids to chlorophyll a would increase resulting 
in higher chlorophyll-specific phytoplankton absorption coefficients [Bricaud et al., 1995] typical of a 
less packaged pigment domain (i.e. summer Bering Sea). 

The scatterplot of measured versus modeled chlorophyll concentration (Figure 19a) overlies the 
one-to-one line, and the quantile plot (Figure 19c) is linear and overlies the one-to-one line. The three 
semi-analytically derived outliers (Figure 19a) are from coastal Labrador Sea stations (Lab97). 
Chlorophyll-specific absorption coefficients for these stations were very low. Sea-surface temperature 
and salinities were also low, perhaps indicative of more optically complex ice-edge meltwaters. Although 
the algorithm fails to perform well here, algorithm performance in the more areal-rich open-ocean waters 
was good. 

The RMS1 and RMS lin errors for the Arctic data (n=75) using the semi-analytical algorithm were 
0.17 and 0.41, respectively (Table 7). Processing the data using the SeaWiFS OC4v4 algorithm resulted 
in larger RMS1 and RMS Sm errors of 0.22 and 0.54, respectively. These larger errors are primarily due to 
the large bias (-0. 143) exhibited when using this empirical algorithm. The higher blue-to-green 
reflectance ratios for these data lead to significant underestimations of chlorophyll concentrations similar 
to those found for the Antarctic data. 

7.0 MODIS Terra data (v4.2.2, reprocessing 004) (ATBD v.7: January 2003) 

Using the MODIS Team-Leader computing facility (MOD APS), validated MODIS Terra data 
(Collection 4, “004 reprocessing”, released June 15, 2002) are compared for Chlor_a_3 and Chlor_a_2 
algorithms. Chlor_a_2 is designed to be a MODIS surrogate for the SeaWiFS OC4 algorithm [O’Reilly et 
al., 2000]. It can’t be exactly the same as OC4 because the 5 10 nm SeaWiFS band is unavailable using 
MODIS data. In this section, the spatial and temporal variability of the empirical (Chlor_a_2) and semi- 
analytical (Chlor_a_3) data products are analyzed. Also, Chlor_a_2 and Chlor_a_3 are compared to in 
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situ chlorophyll a concentrations provided by the SeaWiFS SeaBASS data archive [Werdell and Bailey, 
2002] to evaluate algorithm performance. 

7.1 Comparison of Chlor_a_2 and Chlor_a_3 

SeaWiFS radiance imagery has been used with the Carder et al., [1999] semi-analytic (SA) code 
for an upwelling site [Smyth et al., 2001] and for a river-plume site [Hu et al., 2003]. In the upwelling 
case OC4 retrievals underestimated the high-chlorophyll values while the SA retrievals were much more 
accurate. For the river-plume case OC4 retrievals overestimated chlorophyll for gelbstoff-rich regions by 
as much as a factor of 2. The SA code with global parameters was generally within 35% of field values 
except where the absorption was dominated by gelbstoff, where mean error increased to about 41%. 

Comparing retrievals from the two algorithms for MOD1S Terra global data, composite-averaged 
over the week starting 8 November 2000 (Figure 20), three modal values are apparent in Chlor_a_3 data 
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Figure 20. Histograms of weekly mean global chlorophyll concentration from MODIS for the week 
beginning 8 November 2000: (top) Chlor_a_3 and (bottom) Chlor_a_2 retrievals (courtesy MODIS 
Quality Assurance Browse Imagery, Terra collection 4). Mean values are 0.325 mg m' 3 for Chlor_a_3 
and 0.2 14 mg m* 3 for Chlor_a_2. Note the differences above 0.30 mg m' 3 . 
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at the following values: 0.07, 0.18, and 0.60 to 1.2 mg m' 3 . Chlor_a_2 modal values are found at 0.08 and 
0.15 mg m' 3 , suggesting similar mean values in the oligotrophic central gyres, but reduced Chlor_a_2 
values for more eutrophic waters. With no third mode for Chlor_a_2, its global mean value (0.214 mg m' 
3 ) is roughly 2/3 the global mean value (0.325 mg m' 3 ) provided by Chlor_a_3. This underestimate is 
consistent with low SeaWiFS OC2 retrievals [Moore et al., 1999] and a 27% negative bias for OC4 
retrievals (see Figure 18) for the Southern Ocean field data. This example represents a period of time (e.g. 
fall bloom for the northern hemisphere and spring bloom for the Southern Ocean) when the weakness of 
the SeaWiFS OC2 and OC4 algorithms is most apparent. Note that historical, fall, globally averaged 
chlorophyll a values are 0.305 mg m' 3 [Gregg and Conkright, 2001], very close to the Chlor_a_3 value. 

MODIS Terra chlorophyll images from the GESDAAC (Goddard Earth Science Distributed 
Active Archive Center) for Chlor_a_2 and Chlor_a_3 were composited in 39 km bins for December 2000 
and are shown in Plate 2. The subtropical gyre regions appear similar for the two images, and the northern 
modal retrievals (0.087 mg m" 3 ) and southern modal retrievals (0.078 mg m' 3 ) for the two algorithms are 
essentially the same. However, the global mean value for Chlor_a_3 (0.294 mg m‘ 3 ) is higher than for 
Chlor_a_2 (0.23 1 mg m' 3 ) because in part the high-latitude and equatorial upwelling regions for 
Chlor_a_3 are richer in chlorophyll a. 

Figure 21 shows that monthly mean values for Terra Chlor_a_2 and Chlor_a_3 are similar from 
March to June for austral fall for the southern hemisphere, but Chlor_a_2 is only 60% of Chlor_a_3 
during austral spring, largely due to the package effect. Chlor_a_3 values are larger for northern spring 
data, but smaller during summer and fall. Gelbstoff-rich runoff from northern rivers enrich summer water 
color, apparently increasing perceived chlorophyll retrieved by Chlor_a_2. Chlor_a_3 spectrally 
partitions this type of component from chlorophyll a, effectively minimizing the influence of river runoff 
on pigment overestimates. 

Dividing the globe even further, the following trends were observed. While low-latitude 
equatorial waters exhibit similar average mean chlorophyll-u concentrations for Chlor_a_2 (0.29 mg m' 3 ) 
and Chlor_a_3 (0.27 mg m‘ 3 ) (Table 8), Chlor_a_3 retrievals are higher in the eastern equatorial Pacific 
(Figure 5) due to upwelling. Average mean chlorophyll-u values for mid-latitude waters in the southern 
hemisphere are similar for Chlor_a_2 (0.18 mg m' 3 ) and Chlor_a_3 (0.17 mg m‘ 3 ), but Chlor_a_2 values 
are higher in the northern hemisphere. Again, gelbstoff-rich runoff from northern rivers enriches water 
color, possibly increasing perceived chlorophyll-u retrieved by Chlor_a_2. Monthly mean values for 
southern high-latitude waters are similar for March and June, but Chlor_a_2 is only 50% of Chlor_a_3 
during austral spring (Plate 2, Table 8), again largely due to the package effect. Northern high-latitude 
monthly mean chlorophyll-u retrievals also exhibit larger Chlor_a_3 values in the spring due to pigment 
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Plate 2. Global compisited maps (December 2000) of chlorophyll a concentration (mg m' 3 ) retrieved using empirical 
(Chlor_a_2; top) and semi-analytic (Chlor_a_3; bottom) algorithms from MODIS Terra radiometry. 
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Figure 21. MODIS (reprocessed, Collection-4) Chlor_a_3 and Chlor_a_2 monthly 
means for Northern and Southern hemispheres from Nov-2000 and Feb-2002. 
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Table 8. MODIS Terra (reprocessed. Collection 4) monthly mean chlorophyll-a concentrations (mg m ~ 3 ), 
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15°S-15°N 

15°N-35°N 
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Chlor 

Chlor 
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Chlor 

Chlor 

Chlor 

Chlor 

Chlor 

Chlor 

Date: 

a 2 

a 3 

a 2 

a 3 

a 2 

a 3 

a 2 

a 3 

a 2 

a 3 

Dec-00 

0.35 

0.72 

0.17 

0.17 

0.30 

0.31 

0.36 

0.29 

0.67 

0.78 

Mar-01 

0.31 

0.36 

0.18 

0.14 

0.29 

0.26 

0.39 

0.37 

0.71 

0.97 

Jun-01 

0.27 

0.31 

0.19 

0.15 

0.29 

0.24 

0.27 

0.21 

1.31 

1.27 


0.27 

0.41 

0.20 

0.22 

0.29 

0.28 

0.37 

0.26 

1.08 

0.88 


0.30 

0.45 

0.18 

0.17 

0.29 

0.27 

0.35 

0.28 

0.94 

0.97 


packaging. Differences between Chlor_a_2 and Chlor_a_3 values in this region, however, are smaller 
than for southern high latitudes as gelbstoff-rich runoff offsets the effects of pigment packaging 
[Sathyendranath et al, 2001]. 

7.2 Validation of Chlor_a_2 and Chlor_a_3 

The validity of Chlor_a_2 and Chlor_a_3 data was evaluated using match-ups with ship data from 
the SeaWiFS SeaBASS data archive [Werdell and Bailey, 2002]. Chlorophyll imagery from MODIS 
Terra was compared to ship data following the general match-up exclusion protocol of McClain et al, 
[2000], but only quality-level zero MODIS chlorophyll a data were used. This eliminated data from shoal 
areas (< 30 m) and provided cloud-free data collected within three hours of the satellite overpass. 
Furthermore, for selected pixels within 100 pixels of the scene edge, 5-by-5 pixel means rather than 
traditional 3-by-3 median values were used to improve signal-to-noise ratios. Field chlorophyll values 
within the top 1 0 meters were averaged for comparison with satellite retrievals. 

Global match-up results are presented in Figure 22 for the Antarctic, Equatorial Pacific, 

California Current, western mid-Pacific, and the West Florida Shelf. Note that both Chlor_a_2 (RMS = 
0.174) and Chlor_a_3 (RMS = 0.170) are accurate to less than 1/5 log unit, more accurate than the OC4v4 
algorithm (RMS=0.222) update [O ’Reilly et al, 2000] using an expanded global field of data. Ideally, for 
a 1 : 1 line, A=B=1 .0. Although the RMS error values for the two algorithms are similar, Chlor_a_3 values 
for A and B are much closer to 1.0. This suggests that the error due to overestimates by Chor_a_2 at low 
values and underestimates at high values are comparable to additional scatter about the line associated 
with Chlor_a_3. 

The error is reduced to 0.091 for Chlor_a_3 by removing the seven West Florida Shelf (WFS) 
points. In fact continental shelves with terrigenous runoff are not expected to exhibit typical NDT 
behavior. For the West Florida Shelf, upwelling of 18-20° C from below the Loop Current region is not 
unusual [Austin, 1970; Haddad and Carder, 1979], and convective overturn of surface waters in the 
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Figure 22. Relationships between in situ [Chi a] and MODIS-derived [Chi a] using a) Chlor_a_2, b) Chlor_a_3, c) 
Chlor_a_3 without the West Florida Shelf (WFS) points, and d) Chlor_a_3 with the WFS points that have been NDT- 
adjusted. Dashed lines are one-to-one lines and solid lines are linear regression lines in log-log coordinates. 


winter is expected to refresh nutrients with water having similar thermal properties. We found that using 
an NDT=19.5° C for the seven WFS data points in the match-up data set provided the best algorithm 
accuracy, lowering the error estimate when WFS points are included to RMS=0.088, or less than 1/10 of a 
log unit. This suggests, then, that data sets rich in continental shelf data points should be scrutinized on a 
regional basis. If that is not possible, shelf data can be addressed using a default “global” parameter set 
such as given in Carder et al, [1999], that minimizes extreme excursions due to the package effect. 

The RMSiin values for data in Figure 22 calculated using Equation 21 are 41%, 40%, 21%, and 
20%, respectively. These results are excellent, with nearly all values reaching the 35% goal set for 
MODIS chlorophyll a accuracies. Without stations from continental shelves, the accuracies easily surpass 
goal values for Chlor_a_3. With local NDT adjustments for coastal stations, sub-25% error values also 
appear attainable from Chlor_a_3. 
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8.0 MODIS Terra data (v4.10.11, reprocessing 041) (ATBD v.8: April 2004) 

MOD1S Terra radiance data and data products were reprocessed (version 041, released October 
2003) and nitrogen depletion temperatures were updated prompting a second match-up analysis. 

8.1 Updated nitrate depletion temperatures 

Nitrate depletion temperatures for each 10° latitude and 10° longitude grid in the world ocean 
were calculated previously as the median temperature associated with nitrate concentrations less than 
5pM using data from the National Oceanographic Data Center (NODC) nutrient database [Kamykowski 
et aL, 1986]. Recently, NDT’s were updated by Kamykowski et al (2002) to obtain more representative 
values by calculating real cubic regression roots for each 10x10 degree grid from the same data. Since 
enormous gaps existed using this new approach, these values were regressed against the original NDT 
values [Kamykowski et al , 1986] to provide adequate global coverage. 

These new NDT values were first contoured according to the methods described in Kamykowski 
et al (2002) in Surfer (Golden Software) using a combined variogram/kriging approach (Fig. 23a). 
Variogram modeling is a way to quantitatively assess the spatial continuity of data, taking into 
consideration the length scale, data repeatability, and anisotropy of the data. Using this technique, 
however, real changes in NDT (i.e. upwelling regions) are smoothed out. 

Contouring the data without the variogram modeling produces a map that represents the original 
data better (Fig. 23b). However, the natural anisotropy of the data, whereby NDT’s change with latitude 
more rapidly than they change with longitude, is not taken into consideration, creating contour lines that 
are not smooth enough. The variogram component that weighs the inherent trustworthiness of each data 
point, or the so-called “nugget effect”, causes the kriging interpolation technique to emphasize overall 
data trends at the expense of individual data points. In order to create a balance between preserving the 
data and providing a smooth transition between data points, this nugget effect was reduced. 

Prior to re-contouring the data with a reduced nugget effect, changes in the original data 
[Kamykowski et al, 2002] were made for both the Southern Ocean and the Gulf of Mexico (Fig. 24). 
Since Southern Ocean NDT data were sparse resulting in dips in contour lines where data were not 
present (Fig. 23b), nitrate depletion temperatures at 75 °S of 1° C were artificially added every 10 degrees 
where data were not present and were changed to 1° C where data where present. Also, the sparse nitrate 
depletion temperatures at 65 °S were omitted. 

Since pigment packaging on the west Florida shelf is controlled largely by riverine runoff (e.g. 
CDOM provides photoprotection) as opposed to winter overturn, nitrate depletion temperatures at several 
locations in this region were decreased and one point was omitted prior to contouring in order to provide 
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Figure 23. Global contour plots of nitrate depletion temperatures [Kamykowski et al, 2002] a) with 
the variogram and b) without the variogram. 


lower values that were determined to be more appropriate for this region [Carder et al , 2004] (Fig. 24). 
Previous NDT values of ~23°C for the west Florida shelf resulted in significant overestimations in 
chlorophyll a concentrations when applied to remote-sensing reflectance data collected during the 
Ecology of Harmful Algal Blooms (ECOHAB) program. Decreasing the nitrate depletion temperatures in 
this location to ~19.5°C decreased the level of pigment packaging obtained for winter and spring months 
when sea surface temperatures were generally less than 25 °C, providing a stronger (r 2 increased from 83% 
to 92%), more linear (slope decreased from 1.06 to 1.02) correlation between measured and modeled 
chlorophyll values (n=155). During summer and fall months, unpackaged pigment parameters were 
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Figure 24. Nitrate deplection temperature values (°C) for every 10° latitude by 10° longitude square for which data were 
available [Kamykowski et al, 2002]. Shaded symbols represent changes made to original values: triangles mark locations where 
data were omitted, circles depict values that were changed, and squares show where data points were added. 


obtained using these new NDT values since the NDT-SST threshold of 4°C was exceeded. This resulted 
in underestimations in chlorophyll concentrations. 

Thus, in order to prevent large variations in pigment packaging parameterization that are 
inappropriate for runoff-rich, tropical/subtropical regions, an alternative space-based approach for 
determining the degree of pigment packaging for such regions is required. Decreasing the NDT to 19.5°C 
and capping the weighting coefficient for pigment packaging (Fig. 14) at 80% resulted in an even stronger 
(r 2 =93%) correlation between measured and modeled chlorophyll along with the lowest rms error (0.099 
compared to 0.168 for NDT=23°C and 0.150 for NDT=19.5°C, respectively) (n=155). Perhaps a future 
pigment packaging implementation scheme could cap the packaging weighting factor in these runoff-rich 
regions based on the degree of photoprotection provided by CDOM measured from space as the ratio of 

ag(X)/ap h (X). 

The resulting global NDT contoured data (Fig. 25) were subtracted from quarterly MODIS Terra 
(reprocessing 041) SST data (Plate 3) and these values were binned into four packaging domains 
(unpackaged, transitional 1, transitional 2, and fully-packaged (Plate 4, Fig. 26). In order to provide a 
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Figure 25. Global plot of nitrate depletion temperatures (°C) contoured using combined kriging and variogram (nugget effect set 
to 1 /8 Ih of original value) approach with changes made to original [Kamykowski et al., 2002] data. 

gradual shift between unpackaged and fully-packaged pigment domains, only so-called “endmember” 
waters are classified as unpackaged or fully-packaged while the majority of the ocean is classified as 
transitional. Since this transitional packaging domain spans 5°C (Fig. 26) and comprises ~50-60% of the 
entire globe, this domain is split into two sections (transitional 1 where -1 < SST-NDT < 1.5 and 
transitional 2 where 1 .5 < SST-NDT < 4.0) to better visualize global changes in pigment packaging. 
Global spatial and temporal nitrogen limitation patterns are analyzed using these plots (Plate 4) and 
comparisons are made between the new NDT parameters and implementations scheme and the old NDT 
values [. Kamykowski et al., 1986] and the Carder et al. (1999) implementation scheme (Section 5.1, Plate 
1 ). 

Changing the threshold level for the unpackaged pigment domain from SST-NDT > 3 °C to SST- 
NDT > 4 °C (Fig. 26) results in a much smaller area of the globe classified as unpackaged. While the 
entire tropical and subtropical region was previously classified as unpackaged during both winter and 
summer (Plate 1), a narrow ~30° latitudinal band is classified as unpackaged in the southern hemisphere 
during winter and in the northern hemisphere during summer using the new NDT values and 
implementation scheme. The south Pacific subtropical gyre is unpackaged year round as is the central 
Gulf of Mexico. 
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Plate 3. MODIS Terra monthly sea surface temperatures (°C) (Collection 4, reprocessing 041) for January 2001 (upper 
left), April 2001 (upper right), July 2001 (lower left), and October 2001 (lower right). 



Fully-packaged 


SST-NDT>-1°C 


Transitional 1 Transitional 2 Unpackaged 

- 1 °C>S ST-NDT> 1 .5°C 1.5°C>SST-NDT>4°C SST-NDT>4°C 


Plate 4. Bio-optical domains for the global ocean calculated as the difference between MODIS Terra monthly mean sea surface 
temperatures (°C) (Collection 4, reprocessing 041) and nitrate depletion temperatures (°C) [Kamykowski et al., 2002] modified 
in Figure 24 for January 2001 (upper left), April 2001 (upper right), July 2001 (lower left), and October 2001 (lower right). 
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Figure 26. Comparison between previous blending mechanism (Section 5.1) for 
transitioning between unpackaged and packaged pigment domains and current 
blending mechanism (Section 6.2). 


Regions classified as fully packaged year round include the Southern Ocean, Benguelan Current 
upwelling system off the southwestern coast of Africa, and the north Atlantic (>75°N). The Peruvian, 
northwestern Africa, California, and equatorial upwelling regions are classified as fully-packaged during 
certain times of the year only. 

A basic assumption when determining the degree of pigment packaging associated with a given 
phytoplankton population based on the presence or absence of nitrate using NDT’s is that nitrogen is the 
only factor limiting phytoplankton growth and species succession. Indeed, it has been widely accepted 
that many areas of the ocean are not N-limited, but instead may be iron-limited (i.e. so-called high- 
nutrient, low-chlorophyll (HNLC) areas) (e.g. Martin et al, 1988) or light-limited (e.g. Mitchell et al, 
1991). 

Moore et al. (2002) shows that nitrogen is responsible for limiting phytoplankton growth in half 
of the world ocean during summer months. The other half of the ocean is limited by iron or silica. In the 
future, spaced-based surrogates for iron limitation [Kamykowski et al, 2002; Gregg et al, 2003] may 
possibly complement our existing strategy for inferring phytoplankton cell size and pigment packaging 
from space using NDT’s. Adjustments in chlorophyll-specific phytoplankton absorption coefficients may 
be required in both N-replete and Fe-limited waters. Regarding nutrient-replete, light-limited regions 
such as the Antarctic Circumpolar Current [Mitchell et al., 1991], the SST minus NDT index includes the 

effect of seasonal changes in solar radiation through changing SST over the seasonal cycle [Ohlmann and 
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Siegel, 2000]. 

Since global biogeographic boundaries coincide with nitrate depletion temperature boundaries, 
our NDT scheme for adjusting a* ph (A.) seems to work even though factors other than nitrate can influence 
pigment packaging. Decreased abundances of cyanobacteria, a major component of the picoplankton 
community, in high latitude waters with surface temperatures less than 10°C [Murphy and Haugen , 1985] 
is largely responsible for the increase in average cell size that occurs from subtropical to polar waters 
[Odate and Maita, 1988], Increased cell size contributes to increased pigment packaging, thus requiring 
lower chlorophyll-specific phytoplankton absorption coefficients. 

8.2 Validation of Chlor_a_2 and Chlor_a_3 

With the reprocessing of MODIS Terra radiance data (version 041) and the availability of 
additional match-up data from the SeaBASS data archive [Werdell and Bailey, 2002] a second validation 
analyses was performed using match-up data compiled and made available by the NASA SIMBIOS 
program. Estimates of Chor_a_2 and Chlor_a_3 were derived using both shipboard in situ radiances (Fig. 
27) and MODIS Terra radiances (Fig. 28) and compared to in situ chlorophyll a concentrations (Table 9). 
MODIS Terra SST data and updated nitrate depletion temperatures from the previous section (Fig. 25) 
were used with the Chlor_a_3 algorithm. 

Note that exclusion of an apparently erroneous data point from the February 2002 Plumes and 
Blooms experiment in the Santa Barbara Channel, in which the field chlorophyll value was a factor of ten 
times lower than both the in situ- and satellite- retrieved chlorophyll concentrations, significantly 
improves accuracies for both algorithms. Vertical profiles of chlorophyll fluorescence collected at a 
nearby station indicate that chlorophyll concentrations at ~20m were almost ten times higher than surface 
values in this region, perhaps indicating that this high chlorophyll layer was detected by the radiometers 
but was not sampled for the in situ measurement. Three additional data points from the Secret experiment 
(S201 and S301) were also omitted from the original data set (n=31) provided by the SIMBIOS Team 
because in situ radiances were not available. 

Chlor_a_2 has a linear RMS error of 52.0% when derived from field radiance data. This is due in 
part to inaccurate slope (0.84), bias (-0.090), and scatter (r 2 = 87%) factors for this log-log plot. 
Chlor_a_3, however, has a much lower linear RMS error of 25.3%, due in part to more accurate slope 
(1 .01) and bias (0.002) terms, and less scatter (r 2 = 96%) factors for this log-log plot. 

The 35% linear accuracy goal for chlorophyll a [e.g. McClain et al., 1998; Kilpatrick et al., 2002] 
can be met with Chlor_a_3 given accurate water-leaving radiance retrievals. The 5%- accuracy goal for 
satellite ocean-color radiances [McClain etal. ,1998; Gordon and Voss ATBD 1999] is probably the best 
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In situ chlorophyll 

Figure 27. Relationships between in situ [Chi a] (mg m' 3 ) and a) Chlor_a_2 (mg m" 3 ) and b) Chlor_a_3 (mg m' 3 ) derived from 
shipboard in situ radiance data. One erroneous point (+) from the February 2002 Plumes and Blooms experiment was omitted 
due to strong vertical structure in chlorophyll. Tire dashed line is the one-to-one line. 


that can be achieved for SeaWiFS even with 3x3 pixel smoothing due to digitization errors [Hu et al., 
2001]. Digitization noise will not limit the accuracy of MODIS in the same way because it uses 12-bit 
rather than 10-bit digitization, but calibration and atmospheric-correction accuracies can be limiting. Even 

with accurate field radiances, however, Chlor_a_2 does not achieve the 35% goal using the global 
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Figure 28. Relationships between in situ [Chi a] (mg m" 3 ) and a) Chlor_a_2 (mg m' 3 ) and b) Chlor_a_3 (mg m" 3 ) derived from 
MODIS Terra (Collection 4, reprocessing 041) radiance data. One erroneous point (+) from the February 2002 Plumes and 
Blooms experiment was omitted due to strong vertical structure in chlorophyll. The dashed line is the one-to-one line. 

database and algorithm parameterization presently available. 

The radiances from MODIS Terra were compared to ship-borne radiance data retrieved from the 
SeaBASS web site for the same data points (n=27) used for the chlorophyll algorithm validation (Figs. 27 
and 28). The highest RMS error was found for the shortest blue waveband (e.g. 65.8%, 18.19%, 15.6%, 
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Table 9. RMSi; n errors for Chlor_a_2 and 
Chlor_a_3 (n=27) derived from MODIS and field 
water-leaving radiances (Collection 4, 


Radiance source 

Chlor a 2 

Chlor a 3 

Field 

0.520 

0.253 

MODIS 

0.563 

0.483 


Table 10. RMSi; n errors for MODIS water-leaving radiances 
(Collection 4, reprocessing 041), . 


Wavelength 

All R«(412)/ 
Rr( 443) 
(n=27) 

Rrs(412)/ 

R re (443)>0.5 

(n=24) 

412 

0.672 

0.302 

443 

0.174 

0.130 

488 

0.155 

0.146 

551 

0.240 

0.258 


and 24.5% for 412, 443, 488, and 551 nm, respectively) (Table 10). 

The extremely high error in normalized, water-leaving radiance (or remote-sensing reflectance) of 
the 412 nm band is mostly due to atmospheric correction. This conclusion is drawn from observations of 
a plot of the ratio of 412-to-443 nm bands versus the ratio of 443-to 551 nm bands (Fig. 29). Here the in 
situ radiance data cluster closely about the line developed by Carder et al. (1999) used as an initial 
separation of data points from waters with packaged and unpackaged pigments. Note that some of the 
MODIS radiance data line up with this line, some points line up parallel and below this line, and three 
data points depart at a sharp angle below the line. Note that excluding these three data points with 
R rs (412)/R re (443) ratios less than 0.5 reduces the MODIS radiance errors to values of 29.5%, 14.3%, 
14.7%, and 25.3% for 412, 443, 488, and 551nm, respectively. 

Hawes et al. (2000) found the same type of point departure due to absorbing Saharan aerosols for 
a SeaWiFS scene of the Mediterranean Sea. Blue-absorbing aerosols remove Rayleigh sky radiance as 
well as water-leaving radiance at a rate much larger than do non-absorbing aerosols. When a priori 
calculation of Rayleigh sky radiance is made without inclusion of the blue-absorption aspects of the 
aerosols and used to correct for Rayleigh sky radiance, too much Rayleigh radiance gets removed in the 
correction process. This most heavily impacts the 412 nm band, followed monotonically by the other 
bands. 
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Figure 29. Relationship between R fS (412)/R«(443) and R re (443)/R„(551) for in situ radiance 
data (open symbols) and MODIS Terra (Collection 4, reprocessing 041) radiance data 
(closed symbols). Radiance data with satellite measured J^(412)/R„(443) values greater 
than 0.5 (n=24) are represented with circles. Radiance data with satellite measured 
Rk( 4 12)^(443) values less than 0.5 (n=3) are represented with squares. The dashed line 
represents the Carder et al. (1999) function R fS (412)/R n} (443 = 0.95^(443)^(55 1)] 016 . 


This suggests a philosophy to exclude data points for which severe atmospheric-correction errors 
have occurred. Reducing the data quality or even eliminating data points having an Rrs(4 12)^(443) ratio 
value less than 0.5 removes them from consideration from the Level 0 or the highest quality data sets 
when binning data. This does not compromise actual Case 2 data as Carder et al. (1999) show that even 
waters with significant CDOM have field ratios of Rrs(4 12)^(443) typically larger than 0.6. Another 
option is to develop an equation for the data points parallel but below the Carder et al. (1999) line. Points 
falling below this line could be downgraded in quality or an alternative aerosol model for atmospheric 
correction could be utilized until values above the line are achieved. In this way a much narrower range of 
error is tolerated in the R and nLw(A,) data sets, and a less biased and more-Gaussian error distribution 
is achieved for radiances. 

We have evaluated the effects of the scatter inherent in these data on chlorophyll a retrievals. 
Since only Chlor_a_3 uses the 412 nm MODIS channel, and it has the highest RMS difference (67.2%) 
relative to field radiances, one would expect this inaccuracy of MODIS radiances to more deleteriously 
affect Chlor_a_3 than Chlor_a_2. Figures 27b and 28b illustrate this effect, as the linear RMS error for 
Chlor_a_3 doubles from 25.3% to 48.8% (Table 9), when switching from in situ to MODIS Terra 
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(Collection 041 ocean processing) radiance data. The RMS error for Chlor_a_2 only increased from 
52.0% to 56.3%. These latter values remain similar because the additional scatter in Fig. 28a offsets the 
bias in Fig. 27a. Different behavior of the same algorithm when applied to in situ or to satellite radiance 
data was also observed in the Mediterranean Sea by Ortenzio et al. (2002), demonstrating that 
atmospheric correction is the main source of error in ocean color. 

Note that removal of the three points with R re (412)/Rrs(443) less than 0.5 does not significantly 
change the Chlor_a_3 retrievals (rmsi in decreased from 0.483 to 0.474 for MODIS radiances and 
increased from 0.253 to 0.259 for field radiances). These changes were probably small because the field 
chlorophyll a values were high enough so that the semi-analytic algorithm had transformed into its 
empirical version above about 1 mg m' 3 . Thus, only the 488 and 551 nm wavebands were used. 

Modeling the error fields as the sum of the squares of the in situ-derived chlorophyll algorithm 
error (Table 9) and each MODIS radiance error for the channels used (Table 10) provides the following: 

RMS (chi a) = (£RMS(L(?0) 2 + RMS (Algorithm) 2 ) 0 5 (22) 

i 

This simple model basically assumes that the various sources of error are random and uncorrelated. 
Chlor_a_3 retrieval accuracies expected for MODIS for various MODIS waveband accuracies can be 
calculated using this expression. 

Using the error model (Eq. (22)), the 25.9% error for Chlor_a_3 retrievals derived from field 
radiance data (n=24) can be simulated for MODIS Terra data: [0.302 2 + 0.1 30 2 + 0.258 2 + 0.259 2 ] 0 5 = 
49.2%. This is within 2% of the MODIS retrieval accuracy number. 

If all of the MODIS radiances were only 14% in error, then the Chlor_a_3 retrievals would be 
expected to be accurate within about 35%. If all radiances were only 10% in error, then Chlor_a_3 
retrievals would have about a 30% error. On the other hand, even if MODIS Terra radiance retrievals 
were perfect (e.g. matched field radiances), Chlor_a_2 retrievals would not be more accurate than 48.9% 
for this data set. A not-unreasonable nLw(X) error distribution of 20%, 10%, 10%, and 10% for regions 
with little in the way of absorbing aerosols or for atmospheric-correction algorithms that are highly 
filtered would produce Chlor_a_3 accuracies of 35.6% using Eq. 22. 

Since the goal of the ocean color community has been to provide MODIS and SeaWiFS radiance 
accuracies of about 5% except perhaps for the 412 nm band [e.g. Gordon et al, ATBD 1999], the 
objective of providing a Chlor_a_3 algorithm capable of delivering accuracies better than 35% appears to 
be achievable using Chlor_a_3 with satellite retrievals of water-leaving radiance or remote sensing 
reflectance values accurate to better than 14% or for radiances accurate to 10% for all bands but a 412 nm 
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band within 20% accuracy. 

To achieve more accurate radiances, especially with a coastally dominated data set, application of 
better absorbing-aerosol models appears to be needed. In the meanwhile, numerical filters may be used to 
flag pixels containing highly absorbing aerosols [Hawes et al, 2000]. No amount of improvement in 
radiance accuracies, however, will allow Chlor_a_2 values to reach the goal of 35% accuracy for 
chlorophyll a. 

9.0 Conclusions 

A MOD1S semi-analytical algorithm for chlorophyll a was tested using a total of 976 global data 
points from regions where the pigments were typically unpackaged, packaged, or transitional with 
appropriate algorithm parameters applied for each data type. This resulted in “Version 1 " of the semi- 
analytic algorithm [Carder et al., 1999], which did not address the high-latitude regions with extremely 
packaged pigments (termed “fully packaged”) which are included in “Version 2" of the semi-analytic 
algorithm. The algorithm has thus evolved to better represent the entire global data set. 

The "unpackaged" type consisted of data sets that were generally consistent with the case 1 CZCS 
algorithm and contained mostly tropical, subtropical, and summer temperate data. This algorithm type 
was parameterized using Gulf of Mexico and Arabian Sea data, and it continues to remain the same in 
both Version 1 and Version 2 of the algorithm. The Version 1 "packaged" type of pigment-absorption 
parameterization was consistent with pigments found in eastern-boundary upwelling data sets containing 
somewhat more packaged pigments, but not consistent with the fully packaged pigments found in high- 
latitude data. Note that at times for upwelling centers, some of the phytoplankton have pigments that are 
as highly packaged as found in high-latitude data sets, but that is not true for the general region. The 
packaged data sets studied with Version 1 were processed with the algorithm modified for phytoplankton- 
absorption parameters consistent with the Southern California Bight area. This resulted in two fairly 
equally divided data sets totaling 604 points. That left 372 data points that were not well enough 
characterized to classify. 

The Version 1, semi-analytical (SA) algorithm for chlorophyll a performed well on each of the 
data sets after classification, resulting in RMS1 errors of 0.099 and 0.1 1 1 (e.g., 0.10 log unit) for the 
unpackaged and packaged data classes, respectively, with little bias and with slopes near 1 .0. RMS2 
linearized errors for the algorithms were 23% and 27% for the two classes, respectively. The SA 
algorithm for phytoplankton absorption provided data accurate to about 30%. 

While these numbers are excellent, one must bear in mind what misclassification does to the 
chlorophyll results. Using parameters for an average or transitional domain in the semi-analytical MOD1S 
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algorithm with the global data set («= 976) yielded an RMS2 error of 44.6%, while applying the 
unpackaged parameters on the global evaluation data set yielded an RMS2 error of 92%. So, without 
classification, the algorithm performs better globally using the average parameters than it does if 
misclassification occurs. 

For the difficult transition period between spring and summer or the regions between upwelling 
centers and the warmer offshore waters, a data set was tested that included the eastern boundary region of 
the North Atlantic. Nitrogen-depletion temperatures were used with AVHRR-derived sea-surface 
temperatures to sort stations into packaged, unpackaged, and transitional domains. Comparing sea- 
surface temperature (SST) to the nitrate-depletion temperature (NDT) provides a means of estimating 
how packaged were the pigments for a given pixel. Cold SST values relative to NDTs suggest convective 
overturning or upwelling has brought cold, nutrient-rich water to the surface. Phytoplankton in these 
waters are typically large, contain high ratios of chlorophyll pigment per cell because they have recently 
experienced very low light levels, and exhibit low chlorophyll specific absorption coefficients. Solar 
heating warms this water, reduces the cell requirement for light-harvesting pigments including 
chlorophyll a while increasing the cell requirement for photo-protective pigments, decreases the size of 
cells that can remain neutrally buoyant, and results in an increase in the chlorophyll specific absorption 
coefficient. 

RMS2 errors dropped from 50% to 38% as a result of this data-sorting exercise. Since large 
regions of the subtropical and tropical Atlantic, Pacific, and Indian Oceans remain in the unpackaged bio- 
optical domain during all seasons and provide stable data accuracies from 24% to 28%, it is reasonable to 
expect that use of an NDT-based sorting algorithm with MODIS sea-surface temperatures to separate data 
into appropriate bio-optical domains will result in accuracies for the MODIS semi-analytical chlorophyll 
a algorithm that are significantly lower than our target value of 35%. 

This completed the evaluation of mid- to low- latitude data sets considered for Version 1 of the 
algorithm and used in Collections 1 through 3 for ocean data found in the MODIS Data Active Archive 
Computer (DAAC) at NASA Goddard. Version 2 of the algorithm is discussed below and is used in 
Collection 4 of the ocean data found in the DAAC. Version 2 is summarized below. 

In Version 2, “fully packaged” pigment parameters were derived from high latitude polar and 
temperate upwelling data sets to replace the previously designated “packaged” parameter set. In addition, 
a smoother strategy for transitioning from the unpackaged to the fully packaged pigment domain was 
introduced using sea surface temperatures and nitrate depletion temperatures. These transitions can be 
thought of as transitions between fall and winter (fall overturn) and spring and summer (spring bloom) or 
as transitions between upwelling and offshore waters. 
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The 5° C range between unpackaged-pigment conditions (warm waters) and conditions of fully 
packaged pigments, provides a means of transitioning algorithm parameterization between the two 
extremes using the weighted average of chlorophyll retrievals from the two based upon SST-NDT values 
from -1° C to 4° C. Tests in the Southern California Bight transition region between upwelled and 
offshore waters indicate that errors are reduced from 39 to 3 1% using this method for transitioning 
between packaging domains. 

Compared to the SeaWiFS OC4 empirical algorithm, application of the MODIS Chlor_a_3 semi- 
analytical algorithm to Southern Ocean field data reduced the error in deriving chlorophyll concentration 
by almost a factor of two. Accuracy tests using MODIS satellite data (Collection 4, reprocessing 004) for 
November 2000 show that Chlor_a_3 retrievals of global mean chlorophyll a concentration are within 8% 
of the historical seasonal mean, while OC4-like empirical retrievals using Chlor_a_2 are but 2/3 of the 
seasonal mean. This suggests that Chlor_a_3 semi-analytical algorithm retrievals of chlorophyll a 
concentrations will lead to larger, more accurate pigment and subsequent global primary production 
numbers than are presently being retrieved using SeaWiFS and Coastal Zone Color Scanner data sets. 

Both the Chlor_a_3 and Chlor_a_2 algorithms used with MODIS Terra (Collection 4, 
reprocessing 004) data provide nearly identical modal chlorophyll values for the northern (0.087 mg m' 3 ) 
and southern (0.078 mg m" 3 ) central gyres in December 2000. However, the global mean value for 
Chlor_a_3 is higher (0.294 mg m' 3 ) than for Chlor_a_2 (0.231 mg m" 3 ). This difference can be explained 
by the Southern Ocean where Chlor_a_2 values are on average about half of field and Chlor_a_3 values 
due to lower chlorophyll-specific absorption coefficients typical of this region. MODIS Terra retrievals of 
chlorophyll a using reprocessed (004) Collection 4 data are accurate to better than 41%. Removing data 
points from continental shelves improves Chlor_a_3 accuracies to 21%, while promoting shelf data using 
more appropriate nitrate-depletion temperatures can improve accuracies to about 20%. 

A second match-up analysis was performed in late 2003, using MODIS Terra Collection 4 
(reprocessing 041) data. Chlorophyll-a retrievals from in situ radiances compared to in situ chlorophyll-a 
concentrations are accurate within 52% (linear RMS difference) for Chlor_a_2 and within 25% (linear 
RMS difference) for Chlor_a_3. This establishes the inherent algorithm accuracy if all field 
measurements were perfect. Chlorophyll-a retrievals using MODIS radiances versus in situ chlorophyll a 
values were only 56% and 48% for Chlor_a_2 and Chlor_a_3, respectively. While Chlor_a_3 is 
significantly more accurate than is Chlor_a_2 when accurate radiances are used, the 67% nLw(412) 
retrieval error that affects Chlor_a_3 and not Chlor_a_2 causes satellite-derived Chlor_a_3 retrieval errors 
to decline more rapidly than Chlor_a_2 retrieval errors compared to chlorophyll retrievals from in situ 
radiances. Improvements in sensor calibration and atmospheric correction are the primary improvements 


70 



needed to permit chlorophyll a concentrations from space to reach the chlorophyll accuracy goal of 35%. 
It can be reached with Chlor_a_3 if accuracies of water-leaving radiance for all bands are within 14%, but 
no amount of improvement of radiance accuracies will allow Chlor_a_2 values to reach the goal of 35% 
accuracy for chlorophyll a. 

The reduction in accuracy between chlorophyll a retrievals using data from reprocessing Versions 
004 and 041 may be accounted for in part by the increase in the fraction of points that are coastal. While 
this affects ship and satellite retrievals equally in terms of ocean constituents, it only affects the satellite 
retrievals in terms of aerosol constituents. Sea of Japan aerosols in the spring are often iron-rich from 
Gobi desert dust, while Southern California Bight aerosols are often carbon-rich from pollution. These 
absorbing aerosols would be greatly reduced were open ocean regimes better represented in the match-up 
data sets. Version 004 satellite data were averaged over a 5X5 pixel region to reduce the banding effects 
due to detector-polarization sensitivities at the edges of scenes, which also reduced any random noise 
effects by a factor of 5. To have achieved better than 25% accuracies from MODIS Chlor_a_3 retrievals 
from Version 004 suggests that with more open-ocean data match-ups, with a better filter for absorbing 
aerosols, and with the improved polarization corrections being implemented by Miami investigators (R. 
Evans, personal communication), there is optimism that MODIS Terra retrievals of chlorophyll a using 
Chlor_a_3 can reach the 35% accuracy goal of the ocean color community. 

10.0 Lessons Learned 

1 . Absorption effects of CDOM must be separated from that of phytoplankton to prevent 
overestimations in chlorophyll concentrations. The 412nm waveband is used in the semi-analytical 
algorithm to address this problem. The solution is based on the knowledge of the spectral shapes of 
CDOM and phytoplankton absorption coefficients. Chlor_a_2 retrievals in northern high-latitude 
waters rich in CDOM are generally higher than Chlor_a_3 retrievals most likely due to CDOM. 

2. Variability in the chlorophyll-specific absorption coefficients must be addressed. Comparing 
MODIS-derived sea surface temperatures with globally varying nitrate-depletion temperatures 
permits an empirical function to be developed that smoothly transitions chlorophyll-specific 
absorption coefficients from low values for high-latitude, winter-overturned, and upwelled waters to 
high values for tropical/subtropical and summer temperate waters. Chlor_a_2 retrieval inaccuracies 
are expected for upwelling and high-latitude regions, where underestimates compared to ship and 
Chlor_a_3 data can approach a factor of two. 
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3. Data sets rich in continental shelf data points should be scrutinized on a regional basis. If that is not 
possible, shelf data can be addressed using a default “global” parameter set that minimizes extreme 
excursions due to the package effect. 

4. Both empirical (Chlor_a_2) and semi-analytical (Chlor_a_3) algorithms are expected to be less 
accurate for layer systems such as river plumes and ice-melt plumes with monospecific blooms and/or 
vertical structure. 

5. Detector striping, mirror side-banding, and polarization effects all can contribute to errors in MODIS 
data, although the effects were reduced by at least a factor of five in Collection 4. These are more 
likely to be apparent at the edge of high-resolution scenes abutting adjacent scenes, where different 
view angles compound the differences observed in residual effects due to atmospheric and 
polarization corrections. The 5-km binning of match-up imagery for comparison to field data at scene 
edges reduces the detector-striping and side-banding effects, and residual effects are not apparent in 
weekly or monthly 39 km composites of data used in global primary production estimates. 

6. Although validation stations encompass a wide range of environmental settings, with chlorophyll 
concentrations ranging from -0.05 to 4.0 mg m' 3 , match-up data are heavily concentrated in coastal 
regions and many parts of the ocean (e.g. central gyre regions) are yet to be assessed. Initial results, 
however, are very encouraging, especially for upwelling (California Current and equatorial Pacific) 
and high-latitude (Antarctic) locations. 

7. Since better than 25% accuracies of MODIS Terra (reprocessing 004) Chlor_a_3 retrievals have been 
achieved, there is optimism that MODIS retrievals of Chlor_a_3 can reach the 35% accuracy goal of 
the ocean color community with a) more open-ocean match-up data, b) a better filter for flagging and 
possibly correcting absorbing aerosols that are detrimental to atmospheric correction algorithms, and 
c) the improved polarization corrections being implemented by Miami investigators (R. Evans, 
personal communication). 

8. The accuracy of chlorophyll a algorithms is only as good as the normalized, water-leaving-radiance 
data provided to it. Improvements in sensor calibration and atmospheric correction are the primary 
improvements needed to permit chlorophyll a concentrations from space to reach the chlorophyll 
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accuracy goal of 35%. It can be reached with Chor_a_3 if accuracies of water-leaving radiance for all 
bands are less than 14%, but no amount of improvement of radiance accuracies will allow Chlor_a_2 
values to reach the goal of 35% accuracy for chlorophyll a. 

9. Since errors associated with the empirical Chlor_a_2 algorithm applied to field radiances are 
significantly worse than 35%, no amount of error reduction in satellite-derived, normalized water- 
leaving radiances will reduce Chlor_a_2 retrievals to 35%. Thus, to achieve an accurate long-term 
climate-quality global data set for ocean chlorophyll-a concentrations and derived products such as 
primary productivity, it is strongly urged that the ocean community reprocess SeaWiFS data using 
AVHRR sea surface temperature data and the Chlor_a_3 algorithm. 

11.0 Output Products 

Output products from MOD 19 will include the following: 

1 . Concentration of chlorophyll a for concentrations from .02 to 50 mg/m 3 for optically deep waters. 

2. The absorption coefficient at 400 nm , ag(400) due to gelbstoff or colored, dissolved organic matter. 
All absorption coefficients ag(X) for 400 <X< 700 nm can then be estimated with knowledge of the 
spectral slope parameter S. 

3. The absorption coefficient at 443 nm, a^(443), due to phytoplankton; this is passed along to MOD20 
for calculation of a^(A.) for the visible spectrum as a contribution to the absorbed radiation by 
phytoplankton, ARP, used for fluorescence efficiency calculations. 

4. The sum of a w ( X), a g (/.), and a.,(X) provides the total absorption coefficient spectrum, a,(/0, and the 
diffuse attenuation spectrum, kd(A.) = a, (A.) / cos q 0 . See MOD20. 
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